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DISCUSSION ON 
‘EDDY-CURRENT LOSSES IN THIN FERROMAGNETIC SHEETS’* 


Mr. A. C. Sim (communicated): The paper gives an expression, 
le to Polivanov, for the ratio of actual eddy-current loss to 
at expected by the usual formulae. This is the series 


96d & coth [(2n — 1)7d] 
tf) 3 (2n at) 1)3 


Et 


(A) 


ie author has plotted this and produced an empirical expression 
ecu kage) (B) 


iere r lies between 1-5 and 1-7, which is claimed to be fairly 
curate for ¢ less than unity, where ¢ represents the ratio, 
l, of domain size to sheet thickness. 
No simple expression is given for the series (A) that is more 
nvenient when ¢ is small, so this is offered here. 
Consider the integral 

| coth ee) tan (77z) ae ©) 
ken Over an infinite circle in the complex z-plane, chosen so 
to avoid any poles. Since the integrand vanishes at every 
int on the circle the integral is zero, and hence the sum of 
sidues may be equated to zero. 
The residue at the pole z = 0 is 


2njn*(1 + 447)/6d 
at at the pole z = $(2m — 1) is 
8 coth [(2m — 1)7¢] 


em =P 
d that at the pole z = jn/2¢ is 
2 
es sa? tanh sraled) 
7H 
Summing these residues for m = 0, ee and 


= +1, +2,..., and equating the_result to zero gives the 
ernative expansion 
48¢3 © tanh (nz/2¢) 
eee ny ne vanntD) 
= 


* Ler, E. W.: Monograph No. 284 M, February, 1958 (see 105 C, p. 337). 
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When ¢ is less than unity this approximates to terms in ¢? 
and ¢° only. By replacing the hyperbolic tangent by an expo- 
nential series and changing the order of summation, a further 
form is found in terms of trilogarithms:* 


7 =[1+ 4d? — 48C(3)¢3/73 
+ (963/273) X (—1)"*!Li,(e-"™*)] ©) 
n=1 
For ¢ less than unity, negligible error arises from 


( — 1) ~ 44241 — (124/73)[ 0) — 2e-7/4]} 


where 12/77? = 0:38 702 and €(3) = 1-202. 

I will not take up space with a graph, but I do not consider that 
anyone plotting formula (F) on double logarithmic paper would 
agree with the range 1-5 to 1-7 for r in the approximation (B). 
The most appropriate range appears to be 1-7—-1-8 for ¢ between 
0-1 and 1-0. 

This is important, of course, should anyone wish to compare 
with experiment by plotting (7 — 1) against d. If a slope of 
1-5 were found, this, I feel, would indicate a significant departure 
from Polivanov’s theory, and could be used to imply a probable 
law of dependence of domain size upon sheet thickness, or else 
a departure from Polivanov’s model. 

Mr. E. W. Lee (in reply): Mr. Sim is, of course, quite right; 
the exponent r given in the paper is incorrect and may be mis- 
leading. My point was that Polivanoy’s theory, which gives a 
smooth curve for 7 as a function of ¢ should not be considered 
inadequate merely because the experimental data happen to fit 
a simple power law, as this itself could easily be a consequence 
of the restricted range of ¢ over which the measurements were 
made. 

Since the paper was written this subject has received attention 
from numerous authors. My present feeling is that the essential 
correctness of Polivanov’s theory is now well established, and 
that a comparison of the theory with experiment would enable 
the domain size to be estimated within a factor of about two. 
By extending the frequency range of the measurements it might 
even be possible to ascertain the distribution of domain sizes. 
Such considerations, however, fall outside the intended scope of 
the paper. 


* Lewin, L.: ‘Dilogarithms and Associated Functions’ (Macdonald, London, 
1958), p. 258. 
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A FREQUENCY-RESPONSE METHOD FOR THE PREDETERMINATION OF 
SYNCHRONOUS-MACHINE STABILITY 


By A. S. ALDRED, M.Sc., Associate Member, and G. SHACKSHAFT, B.Eng., Ph.D., Graduate. 


(The paper was first received 21st November, 1958, and in revised form 23rd March, 1959. It was published as an INSTITUTION MONOGRAPH 


in August, 1959.) 


SUMMARY 


The paper presents a new concept for the predetermination of 
synchronous-machine stability. The concept is based on the realiza- 
tion of a basic closed-loop pattern for a synchronous generator, which, 
when established, can be subjected to the frequency-response procedure 
of the Nyquist stability criterion. The basic closed-loop pattern 
emerges from the application of small-displacement theory to Park’s 
equations for a synchronous generator. The method is applicable to 
a machine with or without a voltage regulator, but is more useful in 
the latter case, and this forms the majority of the applications presented 
in the paper. 

It is shown that, as a by-product of the analysis, expressions for the 
damping in a synchronous machine may be derived. Since this is 
considered to be of some interest, the influence of some of the more 
important parameters of the machine and regulator on the damping 
coefficient is demonstrated. 


LIST OF SYMBOLS 
5 = Rotor angle. 
f, w = Frequency, c/s and rad/s. 
H = Inertia constant, kWs/kVA. 
M = A/180f. 
T;, P; = Torque and power input. 
T,, P, = Torque and power output. 
pO = Speed. 
Vjq = Generator field voltage. 
Ryq = Generator field resistances. 
Ijq = Generator field current. 
Xjq = Generator field reactance, 
Xq = Mutual reactance between generator field and 
direct-axis armature winding. 
Vy, = Way 
Rra 
©, = Field flux linkage. 
® jm = Machine direct-axis flux linkage. 
® jm = Machine quadrature-axis flux linkage. 
Tao = XralRyg = Generator field time-constant. 
K, = Damping torque coefficient. 
V, = Busbar voltage. 
Vam, Vq = Direct-axis machine terminal voltage and busbar 
voltage, respectively. 
Vom, Vq = Quadrature-axis machine terminal voltage and 
busbar voltage, respectively. 
Ty, Ig = Direct- and quadrature-axis currents. 
Xam, Xgm = Machine direct- and quadrature-axis synchronous 
reactances. 
X, = Transmission-line series reactance. 
Xj, X_ = Total direct- and quadrature-axis reactances. 
Xm = Machine direct-axis transient reactance. 
Xj = Total direct-axis transient reactance. 


XaP9, open-circuit excitation voltage. 


Correspondence on Monographs is invited for consideration with a view to 
publication. 

Mr. Aldred and Dr. Shackshaft are in the Department of Electrical Engineering, 
University of Liverpool. 


V,,\= Machine terminal voltage. 
V, = Reference voltage. 
p = Regulator loop gain. { 
Ts, [ts = Stabilizer time-constant and amplification factor, 
7. = Exciter field time-constant. 
Ke Amplification factor in the positive-feedback loo] 


ing. Nog 


(1) INTRODUCTION 


Steady-state stability is normally deduced from consideratic 
of the power-angle curve. This procedure ignores dampim; 
both positive and negative, in the machine and consequent 
does not provide a complete picture or solution of the stabili 
problem. This is particularly true for a machine with a voltas 
regulator, when operating in the dynamic-stability region wil 
positive synchronizing torque, because the movement of f 
rotor, subsequent to a small displacement, is largely dependel 
upon the damping in the system. If positive damping © 
present the rotor oscillations will die away. If there is no dam 
ing the machine will hunt, and if the damping is negative tl 
rotor oscillations will increase in amplitude and may eventual 
cause the machine to become unstable. A more exact metho: 
which has been employed by Concordia,! is to determine 
stability by the application of Routh’s? criterion to the coefficien 
of the characteristic equation of motion of the machine. Becau 
the synchronous-machine characteristic equation is non-linear 
is first necessary to linearize it by the method of small displace 
ments, initially described by Park? in relation to synchronoi 
machines. The Routh stability criterion, however, suffers fro 
the general disadvantage that it provides no information concer 
ing the degree of stability of the machine; this knowledge 
frequently more important than merely knowing whether, uni 
certain operating conditions, the machine is stable or not. — 
was observed by the authors that the logical extension of P. 
small-oscillation theory led naturally to the application of t 
Nyquist* criterion, with its attendant advantages of provi 
information concerning degree of stability and a rapid apprec 
tion of the effects of modifications of the machine equatio 
owing to the addition of, for example, damper windings or 
voltage regulator. The Nyquist criterion is a universally-appli 
closed-loop control-system technique by which stability 
deduced from a graphical plot of the open-loop frequen 
response. This implies the physical existence of a main | 
that can be broken at some convenient point in order to d 
mine the frequency response, either analytically or ex 
mentally. When the synchronous machine is considered fre 
this aspect the question arises as to what is the principal lo 
and what constitutes the feedback. The answers are fo 
from an examination of the small-displacement equation 
motion, which is in terms of either torque or power. Thei 
to the closed loop is the torque from the prime mover AT;, 
the output is the rotor angle Ad. The latter is operated on 
a function which may be written as f(p) in operational fo 
f( jw) in frequency form, to provide the feedback f(jw)A6, whi 
is, in fact, the sum of the damping and synchronizing torqu 


[2] 


may be shown that the addition of a voltage regulator, damper 
ndings or any change in the electrical parts of the machine, 
ily alters the form of the function f(jw) and does not interfere 
{th the basic closed-loop pattern. The essential difference, so 
rt as stability is concerned, between the synchronous machine 
id a conventional closed-loop control’system, is that, in the 
rmer, the main loop cannot be broken, i.e. the feedback is 
herent; consequently the Nyquist criterion can only be applied 
\alytically and not experimentally, unless recourse is had to a 
nulator of appropriate form.>:*7 

‘The objects of the paper are to develop the frequency-response 
ethod and use it to demonstrate the influence of some of the 
ore important factors on the stability of a machine without, 
it primarily with, a voltage regulator. An examination will 
50 be made of the damping in a system, and it will be shown 
fat damping torques can be evaluated as a by-product of the 
equency-response analysis. In the authors’ analysis the sources 
| damping are in the field and the voltage regulator. There are 
her sources of damping that may be important, e.g. positive 
imping arising from the use of a damper winding and negative 
imping associated with armature resistance.- The authors’ 
ethod may be extended to include, if necessary, these additional 
jurces, 


2) SMALL-OSCILLATION THEORY FOR SYNCHRONOUS 
MACHINE WITH VOLTAGE REGULATOR 

The system to be analysed is shown in block-schematic form 

Fig. 1. The operational equations for this system are derived 

Section 8 and are as follows: 


fi! O an peereravemnd ee) bo) 

pole Gp) V; — XDI Merry fos (2) 

Vg EMA ASINGO! Wisi CIM eaMeD NE iG iaei!» & o(3) 
V,=V,cos5 . . Borarnernig: 2) Ht (4) 
| T= = Mp*d + Kzpd + On» ee ts ts 5) 
eee Vy ee ee (6) 
SEE ERS IE ie eT 0p) 
EELS Se ea RE 6:3) 

OE He Xoliyic: si iennd etre we? et (9) 


he frequency-response method is based on the small-oscillation 
juations for a synchronous machine. These are obtained by 
lowing all the variables to change by small amounts from 
e initial operating condition. Steady-state initial-condition 
tages, currents and fluxes are-denoted by the suffix 0. 


(pv | G(P)s(Pygo Xi %q Dao + LioXa Mp? + K. 
ao, — Xia GD bakls loc on 
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TRANSMISSION SYSTEM 


GENERATOR 


EXCITER 


RECTIFIER 


REFERENCE 


(its t J VOLTAGE 


AMPLIFIER COMPARATOR 
Fig. 1.—Basic system of synchronous machine and voltage regulator. 


From eqn. (1), 


AVG = X,Al, DOW RN noice sae ean 0 in ott iraywts (LO) 
From eqn. (2), 

AV, a) G(p)AV; — X,(p)Al, . . . (11) 
From eqn. (3), 

AV, = V, Cos doAS : a 5 . 5 5 (12) 
From egn. (4), 

AV, = — V, sin OGD thew saldiemee 1001 3) 


From eqn. (5), 


AT; = OyoAl, + ToA®q ae D,oAla 
— IyA®, + Mp?Ad + KzpAS . (14) 


Now AY gree ee AD i aati 2s Nace CES) 
and NYS VND pe Wi Gabe 816) 
Therefore 


AT; as G(p)A Vp 4 [T,0Xa(D) ae Do ]Aly 
=e [D2 ae Tao Xq|Al, + Mp?Ad + K,pAS . (17) 


From eqn. (6), 
AV, = g(D)AV in SI 6 emma is eur GUS) 


Substituting for AV,,, by using eqns. (7), (8) and (9) we get 
AV, = e(p)[ViAVa — ViX,Al, + VjAV, + VjX,AL] . (19) 


: V, 
where Valea sepraee od \2 4) RYbHs 20) 
m0 
Vamo 
and Aha ELLA sn Se Ot) 
@ Vino 


If eqn. (19) is substituted in eqns. (11) and (14) the complete set 
of small-displacement equations for the machine and voltage 
regulator may be written in matrix form, as in eqn. (22). 


0 0 


Xq 


—X4(p) 


G(p)g( P) Va G(p)g(p) VX, 


—G(p)g(p) Vax; (22) 


0 


0 


hen eqn. (22) is solved for the ratio AT7;/A6 it yields 


Vie(Xa — XQ) 


4X, cos 28 


T; V, 
5 = Mp? + Kap + x Mocs do + 


— 


+ G(p)g(p) Vp sin 89[ XgnXq%o sin 59 Vg — XgmXaVp COS Sq Va] + Ve sin? 89 X,[Xa(p) — Xa] - 3) 
X,X,[G(p)g(p)VgX, — Xa(p)] 
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The conditions necessary for a continuous sinusoidal oscillation 
may be obtained by substituting p = jw in eqn. (23). For this 
type of solution no disturbing force is necessary, i.e. AT; = 0. 

When this substitution has been made it is clear that eqn. (23) 
reduces to the sum of a real and an imaginary part, both of which 
must be zero for the equation to be satisfied. Equating the real 
part to zero gives an equation which may be solved to determine 
the natural frequency of oscillation of the system. The imaginary 
part reveals the amount of damping (either positive or negative) 
required to cancel that inherent in the system. Some expressions 
for this damping coefficient are as follows: 


(a) Machine without Regulator 
COB tae Tae 
ian oo Xj) Vie sin? do 


K, = ¢ 
1S AEE XP 


(b) Machine with Regulator 


G(p) = 


1 Xq + Xa. so 
1+ ta0P" es BP) = 7 a 
UT go VgXqXam COS Sy — Vi XGmXq Sin So 
— pt (VgXqXqm C08 Sy — ViXimXq Sin So) 
f+ (Xa = Xa) Xz sin Sot gol + w*72)___d 
[(uViX, + Xq — Xytetaqw?)® + wA(T.Xq + Ta0Xa)?] 
(25) 


Xap) = 
V2 sin do 


K,= 


(c) Machine with Regulator incorporating a Positive Feedback 
proportional to Rate of Change of Field Current. 


1 Xq + GaoXa — KXa)p 

Wat XP) a > 

Lie Capes tp 17 Gyo — SP 
sp) = — pt 

in which K is a measure of the positive feedback. 

LT gol ViX Xam cos 8o = Vip dg Sdn sin 5o) 

— pK(V{XqmXq COS 8g — ViX,Xam Sin 59) 

q a5 Ta Xq = XV)Xy sin 0 

(UVGX, + Xq)? + w*(tgoXq — KXz)* 


G(p) = 


Vie sin 8o 


K, = 


(26) 


These expressions for the damping coefficient have been obtained 
as a by-product of the frequency-response method. The damp- 
ing characteristics are considered to be of interest, but as this is 
only an intermediate state in the development of the frequency- 
response method a discussion of the damping is deferred until a 
later Section. 


(3) THE APPLICATION OF NYQUIST’S CRITERION OF 
STABILITY TO THE SMALL-OSCILLATION EQUATIONS 


(3.1) Nyquist Criterion 


The Nyquist criterion of stability is used to predetermine the 
stability or otherwise of closed-loop systems of either single- or 
multi-loop configuration. For single-loop systems the applica- 
tion of the criterion is relatively straightforward. For multi-loop 
systems, however, this is not so, since a situation might arise in 
which one or more of the subsidiary loops in the system may 
cause the system to be unstable when the main loop is opened; 
the presence of the main loop, in fact, is necessary for the main- 
tenance of stability of the complete system. To study the 


i 
r 
general case, consider the closed-loop system of Fig. 2, w 
consists of a forward transfer function, ¢,(p), and a feed 


transfer function, ¢,(p). It is assumed that ,(p)2(p) : 
é 
fi 


Fig. 2.—Fundamental closed-loop system. 


represent either a single- ora multi-loop system. The relatiot 
ship between the controlled quantity and the controll 
quantity is 


6, , OO) 
6, 1+ d(p)d2(p) 


By inspection of eqn. (27) it is clear that, for stability, tl 
function [1 + ¢,(p)¢2(p)] must have no zeros with positive re 
parts. 

To develop the Nyquist criterion use is made of an extensic 
of a theorem in complex variable mathematics due to Cauch 
Cauchy’s theorem states that, if a function f(z) is analytic ¢ 
and within a closed contour on the z-plane, 


$ 1 =0 


The extension of the theorem states that if a function is analyti 
except for possible poles, within and on a given contour, t 
number of times the plot of f(z) encircles the origin of the ff 
plane in an anti-clockwise direction, while z itself moves rouf 
the prescribed contour once in a clockwise direction, is equal ' 
the number of poles of f(z) minus the number of zeros of f( 
within the contour. In stability studies the complex variable 
is, of course, the complex frequency p and the function f(z) is f(7 
The contour which is chosen in the p-plane when stability 
of interest in an infinite semicircle, as shown in Fig. 3. 


(2 


SA +joo 


Fig. 3.—Infinite semicircle. Locus of p for Nyquist criterion. 


reason for choosing this contour is that it encloses any voll 
the function ¢,(p)¢.(p) and consequently of the i 
1 + ¢,(p)¢2(p) which have positive real parts, and also 
zeros of 1 + ¢,(p)¢2(p) with positive real parts. A pole of 
function ¢,(p)¢2(p) with a positive real part implies that 
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stem of Fig. 2 is unstable when the main loop is opened. 
les of the function $,(p)¢,(p) with negative real parts are not 
iportant, since they represent stability of Fig. 2 when the main 
op is opened. The general case is considered in Fig. 3. The 
finite semicircle either encloses poles of $,(p)¢(p) (representing 
‘unstable open loop) or it does not (representing a stable open 
Op). 

Since all physical systems have zero response at infinite fre- 
iency, the movement of the function f(p) in the f(p)-plane will 
zero for infinite frequencies, both real and complex. The 
mplex frequencies represented by the semicircular path between 
jo and +joO are therefore unimportant, since, for these 
quencies, f(p) is stationary. The frequencies which are impor- 
at are the real frequencies between —joo and +joo. 

For practical purposes it is more convenient to consider the 
ot of $,(p)d.(p) for p = — joo to +joo rather than 1 + 
(p)2(p), since the former is readily obtainable as the open- 
op transfer function. Consequently, if this function is plotted, 
¢ extension of Cauchy’s theorem is modified to the statement 
at, for 1 + ¢,(p)¢.(p) to have no zeros with positive real 
rts, the number of encirclements of the (—1, 0) point by 
(p)$2(p) as p varies from —joo to +joo must equal the number 
poles of ¢,(p)¢,(p) with positive real parts. 

If the function $,(p),(p) is such that it is zero or finite when 
= 0 no difficulty is encountered in observing the number of 
t encirclements of the point (—1, 0). In many cases, however, 
p tends to j0 the function $,(p)¢2(p) tends to c/p”, where c 
a constant and m may have values of 1, 2 or 3 in practical cases. 
these cases as p tends to j0, $,(p),(p) tends to infinity, and 
order to obtain values of $;(p)2(p) in this region it is neces- 
ry to plot this function as p moves round the origin in a very 
iall semicircle in the positive half-plane as shown in Fig. 4. 


Fig. 4.—Locus of p near the origin for Nyquist criterion. 


In general, as p tends to j0, substituting the value of p on the 
vall semicircular path gives 


$i(p)bo(0) = ae 


‘om this equation it may be seen that the magnitude of c/As” 
nds to infinity as As tends to zero, and the angle of ¢,(p)¢2(p) 
akes n clockwise semicircles centred on the origin as « passes 
rough the values —7/2 to +7/2. The Nyquist criterion has 
en dealt with at length, because, in its application to syn- 
ronous machines, the closures are important, and under certain 
rcumstances there may be poles of the open-loop transfer 
netion with positive real parts. 


(3.2) The Application to Synchronous-Machine Stability 
Problems 


The small-oscillation equation of a synchronous machine and 
regulator [eqn. (23)] may be written in the form 


(INGO 
= Mp? +f(p) . 


Ad ey 
where f(p) = $(8o) + F(p) (29) 
From eqn. (23), 
VV; OED 6 
b(So) = x © cos 9 + a cos 28) . (30) 
and 
{ G(p)g(p) Vj, sin 8o( XamXqVqVb sin So 
fie Ms XgmXaVgVg008 80) + V5 sin? 5) X,[Xa(p) — Xu] 
XgXq[G(p)g(p)VaX; — Xq(p)] 
(31) 
Rewriting eqn. (28), we get 
AS 1/Mp? 
AT, 1+ f(pMp? ne 


The simple block schematic which represents eqn. (32) is shown 
in Fig. 5. This is the basic closed-loop pattern for a synchro- 


f(p)As 


Fig. 5.—Basic closed-loop pattern for synchronous machine derived 
from small-displacement theory. 


nous generator for small oscillations. It is immediately obvious 
from the diagram that the function f(p)/Mp?, in the denominator 
of eqn. (32), is the open-loop transfer function in operational 
form for a synchronous machine. The Nyquist criterion may be 
applied directly to it for p = — joo to +joo in order to determine 
whether the machine is stable and the degree of stability. In the 
following examples of the application of the frequency-response 
method, the principal parameters for the generator and trans- 
mission line are given in Table 1. Several parameters, such as 
generator and exciter time-constants and the regulator gain, are 
considered as variables except where otherwise stated. 


Table 1 
Xam = 1:0 per unit. Xam = 0-2 per unit. 
= 0-7 per unit. H = 5kWs/kVA. 


qm 
X7 =.0-5 per unit. Vp = 1 per unit. 


(3.2.1) Synchronous Machine without Voltage Regulator. 

In this and the following two Sections the initial condition of 
excitation corresponds to unit terminal voltage at unit power 
output. 

The first example is a simple plot of the open-loop transfer 
function for an unregulated generator for several operating 


angles. The open-loop transfer function is 
f(p) 1 | Vie sin? 39(Xa — | 
On) 2 7 (6% 
Mp? Mp #20) Xq(Xq + XaTaoP) - 
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Fig. 6.—Frequency response of unregulated machine for various 
operating conditions. 


This function is plotted in Fig. 6 and the nature of the closure 
of the curves in the region p =j0 is given with each curve. 
From eqn. (33) itmaymay be deduced that the open-loop transfer 
function has no poles with positive real parts, and therefore for 
stable operation there must be no net encirclement of the point 
(—1, 0). 

The significant feature of this set of curves is that, in the 
region of the point (—1, 0), they are all similar. It is only 
when the curves are closed in the region p = j0 that the difference 
between stable and unstable operation is observed. It is evident 
from eqn. (33) that, if (do) is not zero, the type of closure to 
be applied is determined by the sign of $(59)/M(jw)?. Now 
$(59) is the slope of the power-angle curve for the generator, 
and it therefore changes sign as the peak in this curve 
(89 = 81-5°) is passed. At the peak of the power-angle curve 
#(5o) is zero and the open-loop transfer function is proportional 
to 1/p in the region p =j0. Thus a third type of closure must 
be applied. 


(3.2.2) Synchronous Machine with Voltage Regulator—Effect of Gain 
and Exciter Time-Constant. 

The results in this Section appertain to the basic system of 

Fig. 1. Two important parameters of the regulator are the 

time-constant of the exciter and the regulator gain. In Fig. 7 


Fig. 7.—Frequency response of regulated generator showing effect 
of regulator gain. 


do = 90° Kg=0 
tao = S5sec Te=— UL, — 0 
tT = lsec K=0 


the function f(jw)/MGw)? is plotted for three different val 
p. As wis increased the function approaches the point (—1 
For » = 20 and 30 the number of net encirclements is 
but for 4 = 40 there is one net encirclement. Since there 
no poles of the function f(p)/Mp* with positive real parts, 
curve for w = 40 indicates that self-excited oscillations will occu 

The influence of the exciter time-constant is illustrated in Fi 


3 


Fig. 8.—Frequency response of regulated generator showing 
of exciter-field time-constant. 
3o = 90° uv. = 40 
Tao = 5sec ie See f 


i= — 


which shows that, for tr, = 1-0 and 0-Ssec, the sys 
unstable, whereas, for t, = 2sec, stability is achieved. 
may, at first, appear to be an unusual result, but it is ve 
to some extent by the damping-coefficient characteristics 
Fig. 13. An increase of the exciter field time-constant resuli 
in greater stability has also been observed by Messerle an 
Bruck.!° j 
(3.2.3) Influence of Positive Feedback Proportional to Rate of Cham; 
of Field Current. 
In References 11 and 6 a technique is described for imped 
steady-state and transient-stability boundaries by including i 
the regulator a subsidiary positive feedback proportional to t 
rate of change of field current. For the positive feedbac 
have the desired effect it is necessary to nullify the exciter 
time-constant by a phase-advance circuit. In Section 4 it” 


Fig. 9.—Frequency response of regulated machine showing effect 
positive feedback. 


59 = 60° 
Us =%™% =O 


tao = 5 
w= 40 
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wn that the positive feedback has marked effect on the 
nping in the system, depending on the magnitude of the feed- 
k. It is interesting to observe the corresponding effect on 
Nyquist diagram, as shown in Fig. 9. For a small amount 
positive feedback (K = 0-2) the system is unstable, the fre- 
mcy characteristic showing one net ericirclement of (—1, 0). 
K is increased the frequency characteristics move away from 
point (—1, 0) with zero net encirclements. Fig. 10 shows 


(6) 


@) 


Re 


(A) 


Fig. 10.—Effect of positive feedback on rotor-angle variation. 


(g) 3 


(a) K = 0:75 (e) K = 1-75 
(b) K=1-0 (f) K = 2-0 
fe K = 1:25 (g) K = 2:25 
a)K=1°'5 (A) K =2°5 


corresponding effect on transient stability as the positive 
iback is increased. - 


SOME ASPECTS OF DAMPING IN A SYNCHRONOUS 
, GENERATOR 


| was indicated in Section 2 that the damping inherent in 
system could be cancelled in the equations for small oscilla- 
is by including an equal and opposite damping torque 
Ad in the mechanical equations of motion. The value of 
-equired to make the system oscillate with constant amplitude 
S a measure of the damping in the system for the given 
ial conditions and system parameters. A study of the 
iping is useful in that it throws some light on the mechanism 
tability and is also useful for purposes of comparison. The 
iping torque coefficients may be calculated from eqns. (24), 
) and (26), or, if the equations of the system are set up on an 
logue computer, the values of K, required for small oscilla- 
is to exist may be obtained experimentally. The results 
sented in this Section were obtained by the latter procedure 
checked in many instances by mathematical computation. 
he generator and transmission-line parameters employed in 
3¢ experiments are given in Table 1. 


(4.1) Unregulated Generator 


The initial operating point of the machine is chosen to be 
the rated terminal voltage at rated power output. In Fig. 11@) 
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Fig. 11.—Influence of operating condition on the damping coefficient. 


(a) No regulator. 


Experimental. 
—--- Theoretical. 
(6) With regulator. 
Experimental. 
tao = 5 uw = 40 K=0 
T=2 Us=t3=0 


the damping torque is plotted as a function of rotor angle. At 
do = 0 the damping is zero and the machine hunts continuously. 
As the rotor angle increases so does the damping, and this is 
always positive. 

The effect of the generator time-constant is shown in Fig. 12(a). 
From eqn. (24) the damping torque is seen to be zero both for 
infinite and zero field time-constants. Fig. 12(@) shows that, 
over the range of Tz) considered, the damping torque decreases 
aS Tgo is increased. A maximum value for K, exists between the 
values Tg = 0 and 3. 


(4.2) Generator with Voltage Regulator 


The initial operating point at which the regulator is set up is 
again chosen to be the rated power output at rated terminal 
voltage. The first test in this study is to measure the damping 
coefficient as a function of rotor angle, and the result is shown 
in Fig. 11(6). The damping torque is observed to be slightly 
greater than that for the unregulated machine. The effect of 
the generator field time-constant is shown in Fig. 12(b). The 
damping coefficient is again slightly greater than that for the 
unregulated machine. Eqn. (25) indicates that the damping is 
zero both for infinite and zero generator-field time-constants. 
The curves of Fig. 13, showing the effect of exciter-field time- 
constant on the damping coefficient, demonstrate that the 
damping in the system may become negative for small exciter 
time-constants. This is an interesting result since it suggests 
that, for a finite Tz, there is a lower limit to 7, for optimum 
damping. 
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Fig. 12.—Effect of generator-field time-constant on the damping 
coefficient. 


(a) No regulator. 
(6) Regulator present. 


Te = Us = Ts = 0 
u = 40 K= 
So = 50° 
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Fig. 13.—Damping coefficient as a function of exciter-field 
time-constant. 


Td = 5:0 
wu = 40 


Us =t-=0 
K=0 


The curve of Fig. 14(@) shows how the regulator gain affects 
the damping. Changes in regulator gain appear to have little 
effect on the damping coefficient. 

The effects of the gain and time-constant of a stabilizing trans- 
former on the damping coefficient are shown in Fig. 15. 


(a) x10°9 
(6) x1072 


COEFFICIENT, PER-UNIT POWER SEC/RAD 


DAMPING 


1e) 40 860 120 
REGULATOR GAIN 
Fig. 14.—Damping coefficient as a function of regulator gain. © 


(a) Without positive feedback. 
Tao = 5 Vs=ts =O 8 = 50° , 3 
ai — OO. 2% 


(6) With positive feedback. 

tao = 5 K=2 
Us=Ts=0 8 = 50° 
Operation at 5) = 90 (P; = 1-6) in the dynamic stability region 
is considered in this test, since the effect of a stabilizer is generally 
most noticeable in this region. It may be observed that fo) 
curves (b) and (c) of Fig. 15 the damping changes sign a 
stabilizer gains of 6-5 and 7-0, respectively. Thus stabilize} 


Qa 
<q 
a 
— 
VU 
uJ 
Ww r) % 
rm A 
2 4 
fe) | 
a Ps 
t ai 
Zz 
? 
Ha) 
a 12 
s STABILIZER GAIN - 
z 7 
iW A 
12) | 
t 4 
Ww +| 
On6 a 
| 
g a 
g P 
q 4 
a 5 
i 7 
| 
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n is a critical parameter for the correct functioning of a 


bilizing transformer. 


3) Generator with Voltage Regulator incorporating a Positive 
Feedback Proportional to Rate of Change of Field Current 

The positive feedback is identical with that described in 
ferences 11 and 6. The effect of a positive feedback on the 
nping coefficient for various rotor angles is shown in Fig. 16. 
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lig. 16.—Damping coefficient as a function of rotor angle, with 
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positive feedback. 
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‘ig. 17.—Effect of positive feedback on the damping coefficient. 
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By comparing this curve with that of Fig. 11, the positive-feed- 
back loop is seen to bring about a very marked increase in 
the damping coefficient. Fig. 16 also shows that the damping 
coefficient is reduced as the load angle advances beyond its 
normal (i.e. with fixed excitation) maximum. Fig. 17 shows 
how the damping coefficient is affected by the positive feedback. 

The effect of the regulator gain is shown in Fig. 14(6). From 
this graph and also from eqn. (26) the damping coefficient may 
be seen to be approximately inversely proportional to the 
regulator gain. 


(5) CONCLUSIONS 


The small-oscillation study reveals how the various com- 
ponents of the regulator loop affect the damping inherent in 
the synchronous generator. 

The open-loop frequency-response method for predeter- 
mining mathematically whether or not a system is stable, in 
addition to providing information concerning its degree of 
stability, is considered to be an improvement on previous 
analytical work on synchronous machines in this field. The 
technique enables stability problems to be tackled with no more 
apparatus than pencil and paper. However, a word of caution 
is necessary to qualify this statement. The authors have stated 
elsewhere that they have experienced difficulty in inserting the 
appropriate initial conditions into small-oscillation equations. 
This observation is again emphasized. The difficulty can be 
obviated by measurement of initial conditions on the machine 
or by the use of a computer. Failing this the authors have 
found the voltage-excitation characteristics of Fig. 6 in Reference 
6 to be most useful for determining the necessary initial 
conditions, and the work described has been framed around these 
characteristics. 

The authors believe that the frequency-response method 
should prove of value in the design of damper windings, since the 
effect of variations in damper-winding resistances and reactances 
on stability may be conyeniently observed on the Nyquist 
diagrams. 

Finally, it should be noted that the small-oscillation method 
specifically restricts the changes in the system to be vanishingly 
small. Thus it is not strictly correct to interpret small-oscillation 
results as being directly applicable to conditions of large oscilla- 
tions, although the authors believe that there is a correspondence 
between the two. 
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(8) APPENDIX 


(8.1) Operational Equations of a Synchronous Machine and 
Voltage Regulator 


The system to be analysed is shown in block schematic form 
in Fig. 1. 

Park’s® equations for an ideal synchronous machine,’ with no 
amortisseur windings, connected to an infinite system via a 
transmission line are as follows: 


Direct axis Vien = PP am — PgmP? — Rng (34) 
Quadrature axis Vim = DPgm + Damp? — Rmlg (35) 
Field Veg = P® eq + Realy (36) 
in which Dig = Xpalja — Xaala (37) 
i Ki gli Mg ta. (38) 
ape Bini a (39) 

The generated torque is 
Ty = Daly — Ponlig (40) 


The equations for a transmission line in Park’s reference frame 
are as follows: 


Vy = px, a PO — Rly Seen . . . (41) 

Vy = pQ,, ai D,,p0 a Rl, =F Vern (42) 

inwhich ®,, = — XI, (43) 
®,, = — X,], (44) 


The suffixes m and ¢ indicate machine and transmission line 
quantities respectively. The d and g equations for the machine 


may be added to the d and qg equations for the transmission in) 
to give 


Va = p0z— Op) — Ri, . 
V, = p®, + D,pb — 

in which a, = D im SP Oi, 
OD, = Dyn + Oot 

and R= Ros poe 


The direct- and quadrature-axis components of the bi sbe) 
voltage are as follows: 


Vz = V;, sin re) P 
ay = V,cosé. 
The equation of motion of the system is 


d*6 dd 
T, = Mon + Kee + 0,1, — OJy 


The following assumptions are also made: 


(a) The speed change during a disturbance, which results in th 
machine remaining in synchronism, is a negligible fraction of H 
mental speed, i.e. p# is assumed constant and in the per-unit syster 
it is equal to unity. 

(6) Armature and transmission-line resistance is negligible. 

(c) The voltages induced in the armature by the rate of chang) 
of armature flux linkages, namely peg and pQ,, are negligible con 
pared with the voltages, D,p$ and Dgpé, generated by the flu Xe 
@q and ®, rotating at fundamental speed. 


With these assumptions the equations of a synchrono 
machine and transmission line reduce to 


Wj=aX ows oe 
¥,=GOV;- XO, . ae 
V,=V, sind. oo 
Vi = Vsc0s 8. ee . ail 


T, = Mp?8 + Kyp8 + O4l, <ite . . 
The operational expressions for G(p) and X,(p) are as follows; 


1 
CO) Seep 
_ Xa + Xata0P 
110) ieee 


For the voltage regulator in Fig. 1 the equations are as follows : 
Ve =8(P) [Vm — V,) - (60 


V2i=Vin+Ve,. . . nn 
Vim = Va— X4lq (62 
Vom = Vq + X,Ig (63 
where 2(p) = Ft hoa & (4 


a 


q 
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ANALOGUE TREATMENT OF EDDY-CURRENT PROBLEMS INVOLVING 
TWO-DIMENSIONAL FIELDS 
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July, 1959.) 


SUMMARY 

The paper deals with the RC-analogue representation of electro- 
nagnetic fields involving unidirectional current flow, such as those 
issociated with current-carrying conductors lying in armature slots. 
[he resulting magnetic field is in general 2-dimensional and is fully 
epresented in the analogue. Capacitor current in the latter is directly 
quivalent to current flow in the corresponding section of the con- 
juctor, and the eddy-current loss in a conductor may be calculated 
rom the distribution of current between the several capacitors. Some 
imple examples illustrate the translation of field boundary conditions 
nio analogue terms. 


LIST OF SYMBOLS 


E, E, = Electric field strength, V/m. 
H, H,, H, = Magnetic field strength, A/m. 
2 = = Current density, A/m?. 
i = Total current in a single conductor, amp. 
@® = Magnetic flux per unit length in z-direction, 
Wb/m. 
o = Conductivity, mhos/m. 
jt = Permeability, H/m. 
w/27 = Frequency, c/s. 
8 = +/(2/wpo) = Skin depth in conductor, m. 
Rpc = Resistance of conductor by d.c. measurement, 
ohms. 
Rac = Effective a.c. resistance due to eddy currents, 
ohms. 
hk = Interval in finite difference approximation, m 
K = Scaling factor, m/s. 
R = Analogue resistance, ohms. 
C = Analogue capacitance, F. 
I,, I, = Analogue resistor currents, amp. 
. J, = Analogue capacitor currents, amp. 
V = Potential in resistor plane, volts. 
V’ = Potential of common capacitor junction in con- 
ductor representation, volts. 


lo, Ti, . 


(1) INTRODUCTION 


i The non-uniform distribution of current in conductors 
embedded i in an armature slot has been determined analytically 
for a simple geometrical arrangement by several authors.!3 

vided that the slot has a rectangular cross-section of small 
idth/depth ratio and that the conductors are of rectangular 
tion and substantially fill the width of the slot but do 
approach closely its open end, it is a fair assumption that 
magnetic flux paths are straight lines normal to the sides of 
he slot. Under these conditions the current density is a function 
of only one space co-ordinate, namely depth in the slot, and 
evaluation is a straightforward solution of the 1-dimensional 
iffusion equation, with appropriate boundary conditions. 


_ Correspondence on Monographs is invited for consideration with a view to 
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Recent trends in machine design have produced several 
examples of conductor and slot geometry which do not fulfil 
these conditions. For instance, an outstanding advance has 
been made in respect of increased power rating of machines of 
given overall dimensions by the introduction of direct cooling of 
hollow armature conductors. The magnetic flux pattern in such 
a case is not likely to be as simple as that assumed in the 1-dimen- 
sional case, and furthermore the current density will be influenced 
by the variation of effective conductor width with depth in the 
slot. Even so, the flow of current is along the length of the 
conductors and the magnetic field lines are confined to planes 
normal to the conductor length. The field equations in this case 
are particularly suited to a simple analogue treatment,*> and the 
current density in the conductors has a counterpart in the 
analogue which may conveniently be measured. 


(2) FIELD EQUATIONS FOR UNIDIRECTIONAL CURRENT 
FLOW 

The field equations will be set up here for a continuous 

medium of conductivity o and permeability , taking account 

of the simplifications appropriate to this particular problem. 

Assuming sinusoidal time variation of all field quantities at a 

frequency w/27, and ignoring displacement current, which is not 

significant in this case, Maxwell’s equations take the form: 

curl Wi— J (1) 

curl E = — jwuH (2) 

J and E£ being related by 

J GCE 


The time factor, exp jwr, is omitted for ease of writing. 
Choosing the z-axis as the direction of current flow in the 


medium, J and E will only have components J, and FE... Hence 
eqns. (1) and (2), in component form, reduce to: 

be ee thy @) 

of = — jopol, (4) 

S: = jouoH, (5) 


It is clear that, with the assumption of unidirectional current 
density, J,, there is no z-component of magnetic field strength 
and no z-dependence of any field quantity, ie. the field is 
essentially 2-dimensional. Elimination of H, and H, from 
eqns. (3), (4) and (5) shows that J, must satisfy 


Wy, WU, 


eed (6) 


=jwpos, . 


[11] 
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(2.1) In Finite-Difference Form 


Consider the xy-plane in the continuous medium divided into 
square elements, the length of side h being so small that h 
multiplied by the current density at the centre of an element is 
a good approximation to the total current flowing through this 
element. Fig. 1 shows one such element whose centre-point, 


Fig. 1.—Cross-section of continuous medium perpendicular to the 
direction of current flow. 


Dotted lines mark the boundary of a square element of side A used in the finite- 
difference approximation. 


0, is the origin of the x- and y-axes. Points 1, 2,3 and 4 are the 
centre-points of neighbouring elements. Using Maclaurin’s 
theorem, the current densities at points 1 and 3 may be expressed 
in terms of J, and its derivatives at 0 as follows: 


Jn a Jo 
eda sale Oe pee | a eee 
+H] 4 El I, ; 1S eeeey) x4 f ieee 
(7a) 
and Jj, = Jz 
ARSE ene OPI aia ore ame a ae 
‘Lax o 2 32 |, 6 Lox? |o " 2A dx4 6 
(7b) 


where the numerical subscripts identify the location at which the 
field quantities are applicable. 
Adding eqns. (7a) and (7b), 


yy, 
Jz + Jz3 a 2Jz0 i we I 


ht| oJ, 
provided that / is so small that i ae “], is negligible. 


Wy, ; 
Hence sx? = (J, + Iz — 2Iz9) [hk (8a) 
Hae (int 
Similarly ; sae S(CLs ate J24 — QI) | h (8b) 


Replacing the second derivatives in eqn. (6) by the finite-difference 
approximations of eqns. (8a) and (84) gives 


Jay + Jeng + Je3 + Jig — Ag = jwpch*T . . (9) 


The first space derivatives of the field quantities may bi 
approximated in the same way. 


0H, * 
Ea = {[4,]>— 
Py 
Ea = Jz, — Jzo)/h, ete. 


where the subscripts P, Q, S and T refer to the values of thi 
field quantities at the centre-points of the sides of the elemen 
as shown in Fig. 1. Eqns. (3), (4) and (5) may therefore b Hl 
expressed i in the following finite-difference forms: | 


Wg = WX [yl — [Hyls + (dr ee 
Hh[ Hy lo ="F9 — hJ,»)|jwoh* 
ph[H|r = (H2F-4 — WJ,9) iol? . q 
uh[H,]p = (WJ, — )liwch?. . . (12al_ 
LA[Hy|s = (hJ9 — WJ,;)[jwch?. . . Ol 


Thus [H,] shh, etc. 


and 


(3) EQUIVALENT NETWORK 


Fig. 2 shows a portion of a network consisting of equal 5 
resistors, R, connected in a square mesh pattern and lying in 4 


Fig. 2.—Portion of a network analogue for a continuous conducting 


medium. 


single plane. From each resistor junction a capacitor, C, 4 
returned to a common point of alternating potential V’. In the| 
diagram the resistors are divided into units of R/2, since it is} 
intended to show the equivalence of the element of the network| 
delineated by the dotted line to the square element of side A in| 
Fig. 1. Alternating voltages are applied to the network at some] 
remote points, as yet unspecified, giving rise to the distribution 
of voltages and currents indicated in the diagram. | 
The following relationships between the voltages and currents 


of Fig. 2 are apparent: sp 
Vo—V, = [LX ]pR , 
Vo— 6 = [L]oR 
(13) 
V3 — Vo = [I,]sR 
V4— Vo = [L]rR 


Ip = jwC(Vo — V’) 
I, =jwC(V, — Vv’) 


fy = jaVs— V9 | 
Equating the currents at point 0, 
| a = [Jp 4 Jo ca [L.]s siz [Lr 
=(V;, + Vo + V3 + V4 — 4V_)/R [from eqn. (13)] 
=+5h4+44+ 4, — 4)/jwCR [from eqn. (14)] 
‘ie. ee i 2 7, “4%, = joCRhy (15) 


Comparing eqns. (9) and (15) it is seen that the capacitor 
current in the network satisfies the same difference equation as 
| the current density in the original field, and the network is a 
true analogue, provided that 


wCR = wych* (16) 


Multiplication of eqn. (9) throughout by h?, the area of each 
\element of the field, shows that the total current per element 
also satisfies the difference equation. \.The capacitor current will 
| therefore be taken to represent the total current crossing the 
‘corresponding elemental area by putting 


I ma h? Fo 


| (17) 
Expressing J) in terms of analogue potentials, from eqn. (14), 
| FCW, = V) Sek, 

C =h’olK (18) 


| s i ; 5 _ 

| where K is some convenient scaling factor carrying the dimen- 
sions metres per second, 

| 


| 
By setting up 


| Vo — V’ = KEplio . (19) 
| The choice of R is now settled by eqns. (16) and (18), 
re R=Kp (20) 
| From egns. (13) and (14), 

Rilo = Vo — V2 = Uy — E)jwC 


‘Comparing this with eqn. (11a), it is clear that the foregoing 
relationships produce the following equivalence: 


1, = hH, 


(21a) 
Similarly Wee at Vion Bi Se 500969200210) 


Several general features of the analogue may be noted from 
_ these equations: 


(a) Resistance and capacitance in the analogue correspond to 
permeability and conductivity respectively in the medium. 

(b) From eqns. (21a) and (216) it is clear that a resistor current 
indicates the line integral of magnetic field strength, or m.m.f., in the 
xy-plane along the side of the element normal to the resistor length. 

(c) Eqn. (19) shows that the potential difference across a capacitor 
is a measure of the electric field strength at the corresponding point 
in the medium, the phase quadrature arising as a result of the choice of 
analogue components. 

(d) From eqn. (13), 

V; — Vo = — UxlpR 

= Ku[Hy]ph. [from eqn. (215)] 

SURO SOS St ee ee 0279) 
where ®10 is the magnetic flux swept out per metre displacement in the 
z-direction of the line joining points 0 and 1. Consequently points in 
the resistance plane at the same potential define an area in the medium 
having zero normal component of magnetic flux, i.e. equipotential lines 
in the analogue correspond to magnetic flux lines in the original field. 


FA 
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(4) BOUNDARY CONDITIONS 


Since the foregoing analysis has dealt specifically with a con- 
tinuous medium, the first point which requires attention is the 
means and justification for connecting the analogue representa- 
tions of separate media at a common interface. This will be 
illustrated by reference to Fig. 3, which shows the analogue 


as 


Fig. 3.—Network analogue for long straight copper conductor, of 
rectangular cross-section, in air. 


Dotted line denotes the conductor boundary. 


simulating a long straight copper conductor, of rectangular cross- 
section, inair. The copper is represented by the mesh enclosed by 
the dotted line, a suitable interval h being chosen bearing in mind 
the finite-difference approximation, and K being so chosen that 
reasonable values are obtained for R (= Ku) and C (= h?o/K). 
Since air is non-conducting, no capacitors appear in the circuit 
outside the dotted line, and the value of the resistors is Kp, 
where [Zo is the free-space permeability constant. Note that the 
resistors here have the same value throughout the network since 
the relative permeability of copper is unity. 

The direct connection of the two parts of the analogue at the 
dotted line imposes two conditions on the network. In the first 
place the potentials at the dotted line have been equated, which, 
in terms of the original field, implies equating the E, values for 
the two media at the interface, since K has the same value 
throughout the analogue [compare eqn. (19)]. Secondly, the 
continuity of current flow through the resistors crossing the 
dotted line effectively equates the tangential components of 
magnetic field strength at the interface, as seen in eqn. (21), for 
the same interval / in the two media. The direct connection is 
thus justified, since the continuity of the tangential components 
of both the electric and magnetic field strengths at an interface 
is a well-known property of the electromagnetic field. 

To show the way in which a.c. excitation is applied to the 
analogue the specific case will be considered of the conductor 
in Fig. 3 carrying a total current 7. From the initial premise of 
eqn. (17), iis the total current leaving the network at the common 
capacitor junction; this junction is consequently one of the 
supply points of the analogue and has an indicated potential V’. 
To get a complete representation of the field due to this current- 
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carrying conductor, the current will have to be supplied to the 
analogue at points infinitely remote from the conductor. It is 
clear, however, that this current can equally well be supplied 
to the analogue by joining together points lying on any equi- 
potential and raising this common lead to a potential Vz sufficient 
to produce the current i, provided that this equipotential encloses 
completely all parts of the field of interest. In particular, if 
only the current distribution in the conductor is required, any 
equipotential which lies wholly outside the dotted line may be 
chosen: in field terms, this means that the magnetic flux external 
to a conductor plays no part in determining the current distribu- 
tion within it. Since equipotentials in the analogue correspond 
to lines of magnetic flux in the original field, the choice of an 
equipotential requires prior knowledge of the flux pattern in 
that region of the field. In the case of the isolated conductor, 
the lines of magnetic flux will approximate to circles as the 
distance from the conductor increases. It would therefore be 
reasonable here to choose circles of increasing radii until further 
increase produced no apparent change in the region of interest. 

To examine further the significance of potential division within 
the network, suppose that the analogue is infinite in extent and 
that an alternating potential V.. is applied to infinitely remote 
points of the network. Under this excitation let the potential 
of some mesh point M within the dotted line be Vj,._ Integrating 
eqn. (22) from M to infinity, 


Vo — Vi = KOy, 


where ©, is the total flux interlinking unit length of the z-directed 
line through the corresponding point M of the conductor cross- 
section. Hence 


joVn — VK = jwoPy 


ie. the inductive voltage drop per unit length of the conductor at 
M. Also, if J, is the current through the capacitor connected to 
mesh point M, and J,,, and E,,, are the current density and 
electric field strength at point M in the conductor, 


jo(Viy — VK = Iy|/KC 


= £2M 


which is the resistive voltage drop per unit length of the conductor 
at M. 

It is seen that jw(V.. — V’)/K is therefore the total applied 
voltage per unit length of the conductor. In the case considered 
here of a single conductor, no significance attaches to V’, since 
the analogue currents depend solely on the total applied voltage 
Vio — V’. However, if several conductors are present it will be 
necessary in general to raise the common capacitor junctions of 
the separate conductors to different potentials (i.e. different 
values of V’), since the applied voltage per unit length will not 
usually be the same for each conductor. If the applied voltage 
per unit length is the same for each conductor, V’ will be the 
potential of all the capacitor junctions: this clearly corresponds to 
a parallel connection of the conductors, in which case the separate 
conductors can be regarded simply as parts of a single conductor. 


(5) CONDUCTORS IN AN ARMATURE SLOT 


Fig. 4 shows an arrangement of four conductors in an armature 
slot (a) and the corresponding analogue (5). Here there is the 
additional complication of an iron circuit. The dotted lines in 
Fig. 4(a) representing lines of magnetic flux are intended to con- 
vey only the flux curvature near the mouth of the slot, the sym- 
metry of flux distribution about the vertical centre line of the slot 


TO ROOT OF 
SLOT 


‘ f 
(ec) (d) 


Fig. 4.—Analogue representation of current-carrying conductors 
an armature slot. 


(a) Cross-sectional view showing arrangement of four conductors in a slot. 
Lines of magnetic flux. 


(6) Network analogue. Fai 
(c) Arbitrary line, PQ, across slot. 
(d) Corresponding line, P’Q’, in analogue. 


( c E Capacitors connected at heavily do! 
points are omitted for clarity. 


and a pictorial indication of flux closure through the iron ciret 
In the analogue representation the usual assumption of infin 
permeability of the iron has been made, leading to resistors 
infinite value in the analogue, i.e. the iron face is an open-circuii 
boundary. Expressed differently, there is no tangential magne 
field at the interface and consequently no current flow perpen 
cular to the boundary in the analogue. “ 

If the magnetic flux paths in the iron are of interest, | 
representation of this part of the field for a solid-iron armati 
will follow the lines indicated in previous Sections. If 
iron is laminated, as is usual, with the laminations lyi 
in planes z = constant, then current flow in the z-directi 
is suppressed and the field quantities are not strictly invari: 
with z. However, with effective lamination, the flux dens 
in the iron will not vary appreciably with z, and the she 
fall-off of flux density in the intervening insulating layers 
maximum thickness 0-002in) will not be apparent in the s 
at distances from the slot wall greater than the thickness of | 
insulating layer approximately. In such a case an adequate rep 
sentation in the analogue is a resistor network without capaci 
loading, the value of the resistors being K times the aver: 
weighted permeability of the laminations and intervening in 
lating layers. 

The insulating layers between the conductors in the slot : 
assumed to be of negligible thickness. Their sole function 
isolating the conductors is then catered for by isolating | 
common capacitor junctions of the several conductors. 17 


k Beanuity of tangential magnetic field strength through the layer 
is represented by the continuity of current flow through the 
resistors normal to the boundaries in the analogue. 

As an example of setting up boundary conditions in this 
arrangement, the case will be considered in which the conductors 
‘are fully transposed, i.e. each conductor occupies each of the 
four possible positions in the slot over an equal length. This 
means that each conductor carries the same current i. Conse- 
quently, in the analogue the remote lines A’B’ and C’D’ are 
raised to a potential Vg, and V;, V5, V3; and V4 are so adjusted 
‘that equal currents leave the network through these connections. 
‘The lines A’B’ and C’D’ are chosen symmetrically disposed with 
‘respect to the slot, since their counterparts AB and CD in the 
original field must lie on the same closed line of magnetic flux; 
‘moreover, A’B’ and C’D’ are sufficiently remote from the slot 
that they can be judged with reasonable accuracy. If the top 
eenetor is so far from the mouth of the slot that it does not 
enter the field fringing region, a single line of application of the 
yoltage V;, such as F’G’ can be used. 

As a check on the validity of the representation and, in par- 
ticular, that the current distribution throughout the conductors 
is truly portrayed in the analogue, consider any arbitrary line 
PQ across the slot and its counterpart P’Q’ in the model [Fig. 4(c) 
and (d)]. Any elementary length ds of P’Q’ may or may not 
intercept resistors carrying current in the analogue. The 
probability of its intercepting an x-directed resistor is dy/h, and 
of its intercepting a y-directed resistor is dx/h. The mean 
‘current crossing such an element towards the root of the slot is 
therefore 


dy dx 
ne a 


‘Hence the total current crossing P’Q’, ie. the total current 
Teaying the analogue through capacitors connected on the root 
‘side of P’Q’, is 


i i ay its = 


' os t (H,dy + H,dx) [from eqn. (21)] 
rs PO y 


I 


=—|A.ds 
PQ 


=| A.ds 
OP 


the line integral of magnetic field strength from Q to P. 

Since the magnetic circuit is completed through a zero- 
sluctance path around the slot root, this line integral is equal 
to the total z-directed current flowing through conductors, or 
tarts of conductors, lying on the root side of PQ. The choice 
f PQ has been quite arbitrary; hence the equivalence of con- 
‘ductor currents and capacitor currents throughout the conductor 

ion has been established. 


_ (5.1) Calculation of Eddy-Current Losses and Effective 
Resistance 


By measuring the current through individual capacitors, the 
ent flow through the corresponding elements, of area h”, of the 
onductors is determined. Fixing attention on one particular 
onductor, let its cross-section be represented by N square 
ements in the analogue. In the absence of eddy currents, the 
ohmic loss per metre in this conductor is given by 


4?Rpc = 3i?/Noh* 
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where Rpc is the value of resistance per metre obtained by d.c. 


measurement, and 7 is the peak value of the total conductor 
r=N 
current, i, given by i= >} 
i 

the rth capacitor associated with this conductor. 
The resistance per metre offered by the conductor to the 


current through each element is 1/ch?. 


I,, where J, is the current through 


r=N 
Hence, the actual power loss = §} 4£2/ch? 
nie 
1 
~ Joh? 


If this power loss be written in terms of an eff ective a.c. resistance 
Rac, as 47 Rac, 
r=N 
pes 1A 


r=1 


Ric Aah kc 
Roc ga 

x I, 

r=1 peak value 


(6) COMPARISON OF UNIDIMENSIONAL ANALOGUE 
RESULTS WITH ANALYTICAL FIELD SOLUTIONS 

Although the value of this analogue technique rests mainly 
on its applicability to problems in which the time-varying mag- 
netic field is 2-dimensional, a few 1-dimensional problems will 
now be considered, partly to show the use of different boundary 
conditions and how they are translated into analogue terms, 
and also by comparing the solutions with known analytical 
solutions to give an indication of reasonable values of the 
interval h. 


then (23) 


(6.1) Semi-Infinite Solid 


The eddy-current loss and flux penetration will be determined 
for a semi-infinite solid conductor, or infinitely thick plate, when 
it is subjected to an alternating magnetic field tangential to the 
surface. The surface is taken as the plane x =0, and the 
y-axis as the direction of the magnetic field of strength Ayo 

A well-known result of the analytical solution of this problem 
is that the current density and magnetic field strength decrease 
with increasing depth in the solid, being reduced by a factor 
1/e in a distance 6 = +/(2/wyc), known as the skin depth or 
depth of penetration. If the interval h chosen for the analogue 
is expressed in terms of 6, a solution is obtained which is 
applicable to any conducting solid. Let A = 8/2: then, from 
eqn. (16), 


wCR = woh =i s (24) 


It will be found unnecessary to specify C and R separately. 
The analogue representation of this problem simplifies to a 
ladder network of infinite length, as shown in Fig. 5, since there 


Fig. 5.—Network analogue for a semi-infinite conductor subjected to 
a uniform alternating magnetic field tangential to its surface. 
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is no y-variation, and each network section refers to an element 
of the conductor of cross-section h x fA in the xy-plane. To 
represent the field in the conductor fully the voltage should be 
applied to the network between terminal A and an infinitely 
remote point along the resistor chain, but since no current will 
flow through capacitors at infinite values of x, the remote 
potential is that of terminal B and the voltage is consequently 
applied across AB. The given boundary condition in this 
problem is the value Hy of the magnetic field strength at x = 0. 
Since, from eqn. (21D), 


High = — HSI? (25) 


the applied voltage must be so adjusted that the input current, 
I,o, to the line network is given by eqn. (25). 
Analysis of this circuit shows that 


Mey Se 


Tu 1 == ip — jd: 
—— epnihs ee ee 
2 zB a 2 | : 41 [from eqn. (24)] 

= 0-601 , — 28° 
7 haere Lee 


where I,, I, ... J,, I-41, .. . are the currents through the 


capacitors! in ithe dsta2nd, ey = ithe ae) thse ieesections: 
Also a = 0°469 + j0-282 = 0-5477/31° 
x0 ert 
Hence 
= (co 62 
Df? = A975 0-547) 
r=1 4 


[1 + (0-601)? + ©-601)* + (0-601) + ...] 
yo 8  0800 
4 1 — (0-601)? 


62 


The capacitor current J, is equal to the total z-directed current 
through the rth conductor element of cross-sectional area h?, 
the elements being numbered from the surface. Each current 
consequently experiences a resistance of 1/ah? = 4/06? per unit 
length in the z-direction. Hence the total loss at all depths of 
the conductor below a surface area of 1 x 6/2 is given by 


r=0 4 
4 >» a) = 0:469A2)/20 


Therefore the total loss per unit surface area = 0-469A?)/o8. 
This is to be compared with the analytical value of H?,/208. 
The discrepancy of 6% in the analogue value may be reduced by 
the choice of a smaller interval 4; thus, taking h = 6/4 leads 
to a value of 0-490A2,/06 for the loss per unit surface area. 

‘The total magnetic flux © per unit z-width in the conductor is 
obtained from eqn. (22), i.e. 

Ne Lipa E(t 1) 
K BNR oe 
Ea ( jJOCR <UL; 
jwCK Dy) Io 
ie. ® = 0-709f,o/wCK 


ho So 
But C= x Ae 
BSS 
2 wos? 

= 0-709 ,98 
The analytical value is 0-707 4Ay09. 


herefore ® = 0-709H,o 


(6.2) Single Conductor Filling Rectangular Slot 


The case of a single conductor, height 26, filling a rectang 
slot reduces to a 1-dimensional problem if it is assumed that 
magnetic flux paths in the slot are straight lines normal t< 
sides. Fig. 6 represents a section of the conductor of width 


R/2 ae 


Fig. 6.—Network analogue for a single conductor filling a rectang 
slot, ignoring field-fringing. 


and height 26, ie. h = 6/2. Since the magnetic field stren 
H,, at the root of the slot is zero, the equivalent networ 
open-circuited at its lower end. Consequently the total curt 
I, through the capacitors, which must be equal to the cur 
through the corresponding section of the conductor, enter 
terminal A and leaves at B. To determine the effective 
resistance of the conductor it is necessary to find the divisio: 
this total current between the several capacitors, as pointed 
in Section 5.1. 
Circuit analysis shows that 


(3/72 = \1 + 70-5]? = 1-25 
(3/P2 = |0-75 4+ j1-5|? = 2-81 


13/f? = |— 0-25 + j2-875|? = 8-33 
r=4 
Therefore [2 = 13-39f? 
r=] 
Also f/f? = |2-5 — j4-875|? = 30-02 
Sp 
4 
Rac r=1 i 
Hence — = Aaa 
Ree pn = 1-78 


Solution of this problem analytically without the finite differ: 
approximation yields a value of 1-90. 


(6.3) Loss in a Particular Conductor in a Rectangular SI 


This is the familiar problem of a number of identical « 
ductors of rectangular cross-section, each carrying a curre! 
the conductors lying directly one above the other in a rectang 
slot. The ladder network of Fig. 7 is the analogue representa 
of the nth conductor from the root, the conductor height bi 
taken as 6 for this particular calculation and the inte 
h = 6/4. The analogue therefore represents a section of 
conductor, 5/4 wide by 6 high, and the total capacitor curre 
is a corresponding fraction 6/(4 x slot width) of the condu 
current i. The ratio of the magnetic field strengths at the uy 
and lower edges of the conductor is m/(n — 1), since this is 


R/2 


D 
fin=0z 


ry 


ig. 7.—Network analogue for the nth conductor from the bottom 


; of 5 slot carrying a number of rectangular conductors stacked 
in line. 


tio of currents flowing in the slot conductors below these 
vels. Consequently, this must also be the ratio of the current 
ntering at terminal A to that leaving at terminal D. This leads 
) the terminal currents indicated in Fig. 7. 

ircuit analysis along the lines indicated in the previous Section 
ves Ryc/Rpc = 1:076 + 0:299(n* — n) as compared with 
uclRpc = 1-084 + 0-323(n? — n) by field analysis. 


(6.4) Partially Transposed Conductors 


CONDUCTORS IN CONDUCTORS IN 


FIRST SLOT SECOND SLOT } 2(1,+15) 


ote sete oe ee ree eo ee 


POSITION 
IN SLOT 


Fig. 8.—Analogue representation of four conductors partially 
transposed in a slot. 


through two slots between parallel connection points, the 
er of the conductors in the second slot being a simple reversal 
‘that in the first. In such a case conductors A and D will 
equal currents, i;, and conductors B and C will carry 
rents iz, The equivalent analogue has been drawn on the 
isis of a relatively coarse mesh, since it is the intention here 
ly to illustrate the setting up of the analogue. The chosen 
terval. A, is half the height of a conductor; hence each con- 
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ductor is represented by only two network sections and the 
analogue currents J, and J, are a fraction (/conductor width) of 
the actual conductor currents i, and i,. The magnitudes and 
phases of the applied voltages Vz, Vj, V5, V3 and Vz must be so 
adjusted that the indicated equalities in terminal currents are 
obtained. There is a further condition arising from the parallel 
connection. Ignoring the voltage drops in the end connections, 
for each slot the sum of the voltage drops per unit length in 
conductors A and D must equal the same sum for conductors B 
and C. In Section 4 it has been shown that in the analogue this 
must lead to 


Ve- Vi) + Ve- Vi) = (Ve — V3) + Ve — V3) 
ie. Viva Vs Vs 


Since voltage differences only are significant, V, may be set equal 
to zero without imposing any further condition on the analogue. 

Hence Vi = V3 + V3, if Vs =0 
The currents J;, J;, I4, [5 and Ig can be expressed in terms of J, 
using the above voltage relationship and the four mesh equations 
from the network. Such a coarse mesh does not justify eddy- 
current loss determinations for conductors of reasonable height, 
say 6, but it is interesting to note that for this height the analogue 
indicates that the ratio of i, to 7, is rather less than 0-5. 

To obtain significant loss data a much smaller interval would 
be required. Although it would still be possible to calculate the 
current distribution, it would be lengthy and tedious. This is 
one of the problems which will be solved by a practical analogue. 


(7) PRACTICAL CONSIDERATIONS 


To avoid undue complication in drawing the analogy between 
the network and the electromagnetic field, only one scaling factor 
K [eqn. (18)] has been introduced. K enables R and C in the 
analogue to be so chosen that the impedance level is satisfactory, 
and by assigning a dimension (m/s) to K the correspondence is 
made dimensionally correct. There is no reason, however, why 
other purely numerical scaling factors should not be introduced 
to permit greater convenience in operation. Two instances will 
be mentioned in which further scaling is either advantageous or 
possibly even necessary. In the first case, by operating the net- 
work analogue at a frequency higher than that of the field 
problem, the CR product in the analogue is correspondingly 
reduced; this could well lead to cheaper capacitors of greater 
stability. Secondly, in large machines the total current carried 
by the conductors in a single slot is frequently several thousand 
amperes; clearly a large scaling factor would be necessary here, 
since the foregoing analysis has taken a 1 : 1 correspondence 
between conductor and analogue capacitor currents. It should 
be noted, however, that the expression for eddy-current losses 
in the form R4c/Rpc [eqn. (23)] does not depend on the magni- 
tude of the total current but only on its distribution between the 
several capacitors. Since the analogue contains only linear 
elements the total current may be chosen arbitrarily in that case. 

No consideration has been given here to such general analogue 
problems as the adjustment of mesh values to suit curved 
boundaries or a geometry involving non-integral spacing, since 
these features have been treated in the published work of 
Liebmann*® and others. 
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DISCUSSION ON THE 


Mr. R. S. Mamak (communicated): The equations of eddy- 
current losses were first formulated by Field, and solved for the 
case of single-dimensional current and field. However, it can 
be fairly simply shown by the following example that the assump- 
tion that the field goes straight across the slot is contrary to the 


Z 


; 
SIDES OF 
SLOT 


CURRENT-CARRYING 
CONDUCTOR 
es 


Fig. A 


laws of electromagnetism. Along the path ABCD, no current 


H.dl=0. With the assumption 
‘ABCD 


is enclosed; consequently 


that only H,, exists, this means that Haz = —Hpc. It follows 

that ® H.dl =O, but this is impossible, since it encloses finite 
‘AEFD 

current. 


However, the results as obtained by Field’s analysis were 
found to be in close agreement with practical results, and his 
assumption was consequently accepted. What is not so well 
known is that Steidinger“ did attempt to obtain a more rigorous 
analytical solution by solving the two-dimensional form of the 
diffusion equation [eqn. (6) of the paper]. In practical cases 
this meant that Field’s « has to be multiplied by another constant 
B, i.e. Racl Rac = «8. The practical implications of Steidinger’s 
modification have already been fully discussed by Vallarta.® 

As Steidinger’s analysis shows, even for rather simple boundary 
conditions the analytical solution of the diffusion equation 
becomes wellnigh impossible: quantitative analysis has to be 
resorted to, where the main techniques are either by analogy or 
numerical methods, and obviously all the usual digital versus 
analogue-computer considerations still apply. 

A powerful numerical technique for solving differential 
equations is the replacement of the continuous differential 
equation by a number of ‘finite difference’ equations, of the form 
of eqn. (9) of the paper. These equations may be collected and 
expressed in matrix form. It will then be seen that, essentially, 
it is necessary to invert a complex matrix. Mr. Roberts has 
inverted this matrix by using an ingenious RC network. How- 
ever, for those not possessing an analogue computer, various 
powerful numerical methods of inverting matrices are available.© 
Finite-difference equations may also be obtained for all the 
boundary conditions discussed in this paper—in fact the finite- 
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difference equations for a problem very similar to that illustr 
in Fig. 4 have been formulated, and it is hoped to publish t 
in the near future. \ 

Finally, in specifying the boundary conditions for the | 
ductors in a slot [Figs. 4(a) and (5), lines ABCD and A’B’C 
it is not clear what assumptions are being made with respet 
the current-carrying conductors in the slots on either side o! 
one shown. If the currents in the neighbouring slots are 
equal and opposite to those in the slot under consideration, 
condition that the lines of force follow the pattern illustrate 
Fig. 4 is incompatible with the statement that the iron is infin 
permeable. Consequently if the effect of the conductors in 
other slots is to be considered, the inherent antisymme 
existing in most machines should be utilized, though, of cot 
this would entail taking a batch of slots. 
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Mr. J. Roberts (in reply): Mr. Mamak draws attention to 
shortcomings of the one-dimensional analysis used by Field 
the conductor in an armature slot, but it is well to note tha’ 
pointed out in Section 1, this is a justifiable assumption 
rectangular conductors which substantially fill the width of 
slot and do not approach closely its open end. This condi 
is met in many practical designs. 

Fig. 4 shows a single isolated slot with the approxir 
magnetic flux pattern. The flux lines AB and CD, whose dist 
from the mouth of the slot is equal to the length of the air 
will be very nearly straight. Considering now the effec 
neighbouring slots carrying conductors, additional flux wil 
superimposed on that shown in Fig. 4(a). The only portio: 
this flux which will change the current distribution in the « 
ductors shown will be that which penetrates these conduct 
The effect will be small if the air-gap is small, since then 
additional flux will fall off rapidly with depth in the slot, an 
only likely to produce current redistribution in a top condu 
which closely approaches the mouth of the slot. To take acce¢ 
of this effect on a single slot representation would involve 
application of suitably different potentials to the lines A’B’ 
C’D’ in the analogue. The current leaving the network via 
capacitors would not then be supplied equally from the two 
works representing the adjacent air-gaps. 
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URFACE-INTEGRAL METHODS OF CALCULATING FORCES ON MAGNETIZED 
IRON PARTS 
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SUMMARY 


ie force exerted on an iron part in a magnetic field may be calcu- 
| by a number of different methods, all of which give equivalent 
: distributions confined principally to a surface. Seven different 
iods, giving rise to five different surface force intensities, are 
m to result in the same total force. Additional volume force 
ponents are obtained and a new expression for the actual distribu- 
of force, consistent with all of the methods of calculation, is 
ed. A common interpretation of the well-known Maxwell field 
ses in iron is shown to be incorrect. When the field is specified 
erically the calculation of force in terms of surface integrals has 
in advantages over the virtual-work method, and it is shown that 
le analytic expressions for force which are usually obtained in this 
may be easily derived as surface integrals. Practical applications 
onsidered. 


LIST OF SYMBOLS 


B = Magnetic flux density in vacuo. 
B’ = Magnetic flux density in iron. 
H = Magnetic field strength in vacuo. 
H’ = Magnetic field strength in iron. 
J = Conduction current density. 
M = B — oH = Polarization. 
No = Primary magnetic constant. 
je = Permeability. 
p = Magnetic pole density. 
Suffixes 
s = Surface density. 
n = Component normal to surface. 
t = Component tangential to surface. 


(1) INTRODUCTION 


n important phenomenon underlying the operation of most 
tromagnetic apparatus is the mechanical force which is 
‘ted on an iron part in a magnetic field. Such forces are 
ally explained and calculated in terms of the virtual-work 
ciple,!» 2» 3 which provides a powerful method of analysis 
ling to simple expressions in terms of rates of change of 
actance when saturation and hysteresis may be ignored. 
en this is not the case, and when the magnetizing windings 
distributed, the virtual-work method is less simple, and it 
is to produce inaccurate results when the small differences in 
| energy produced by the virtual displacement have to be 
luated numerically. It is also open to objection in principle, 
€ it suggests that, when the magnetization is non-linear, the 
e is essentially dependent on the changes produced by a dis- 
sement of the iron part concerned, rather than on the field 
figuration associated with its specified position. 

in alternative method of approach is to express the mechani- 
force in terms of an integral over a surface (usually the iron 
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surface) supplemented in some instances by an additional 
volume integral. Perhaps the best-known example is the 
calculation of the force on the equivalent magnetic poles in the 
iron surface, but other methods (including the energy principle) 
may be used, as described in Section 2. Most of these methods 
are well known, but their practical application appears to have 
received little attention. One possible reason for this is that, in 
many problems in which the virtual-work method leads to a 
very simple calculation when the field configuration is suitably 
idealized, the way in which the same result may be obtained by 
surface integration is less obvious. An example is the force 
tending to pull two poles, or teeth, into alignment (Section 5.1). 
In addition, the simplicity of the surface-integration principle 
tends to be obscured by the fact that the different methods of 
calculating force in this way lead to several different expressions 
for the surface force intensity. The equivalence of these expres- 
sions is by no means obvious, and is confused by the fact that 
most, if not all, have been interpreted by different writers as 
representing the actual force distribution within the iron. The 
differences have led to a number of discussions of apparent 
anomalies (e.g. References 4-7), and in other instances the 
different methods have been incorrectly applied. 

The object of the paper is to examine the different surface- 
integral expressions for force on iron parts and to investigate 
their equivalence and practical application. The methods are 
compared and assessed only in so far as they offer means of 
calculating total forces; a review of the mechanism by which 
the force is exerted and of the actual force distribution within 
the iron requires too lengthy a treatment to be included. The 
discussion relates specifically to iron parts, since forces on ferro- 
magnetic materials are of the greatest practical importance, but 
all of the methods, and the equations deduced, may be applied 
by analogy to the forces acting on dielectrics in electric fields. 

The M.K.S. rationalized system of units is employed. In one 
respect, namely the substitution of 79 for jo and w for p,, the 
symbolism is that suggested by Cullwick and Carter.?° This 
change is designed to avoid the unfortunate confusion between 
the properties of polarized materials and the electromagnetic 
relations in vacuo which has become associated with the M.K.S. 
system. 


(2) EXPRESSIONS FOR SURFACE FORCE 


(2.1) Representation of Magnetized Iron by Pole and Current 
Distributions 


The most direct method of calculating forces on magnetized 
iron is to integrate the average forces on the dipoles which are 
the source of magnetization. This results, however, in a volume* 
as well as a surface distribution of force®» 9 (whether or not the 
permeability is constant or the iron carries conduction currents), 
and it is therefore of no practical importance as a means of 
calculating total force. Excluding also the direct application of 


* When curl H = 0 the method gives the same force distribution as the third 
Maxwell stress system discussed in Section 2.5. 


[19 ] 
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the virtual-work method, there are at least seven other different 
ways in which expressions for force can be obtained. 

First, any distribution of poles or currents, or both, which, 
when put in place of a piece of magnetized iron, reproduces the 
magnetic-field conditions at all points outside the iron, must 
experience the same total mechanical force as the part which it 
replaces. A simple example is provided by the method of 
images, in which the force acting on a large iron plate due to any 
system of magnetic sources is the force which would act on the 
image sources. A more general form of equivalent source” !° 
is a volume distribution of poles of density 


p=—divM (1a) 
combined with a surface distribution of density 
ps =, (16) 


This is a complete equivalent source provided that the iron 
carries no conduction currents and contains no ‘free poles’. An 
alternative equivalent source!® subject to the same restriction 
is a volume distribution of currents of density 


Af = y curl M (2a) 
70 
combined with a surface density 
1 
J, =—M, (2b) 
70 


where the direction of J, is at right angles to M,. 

To illustrate the significance of these equations consider a 
cylinder of iron magnetized uniformly in a direction parallel 
to its axis. Eqn. (1d) states that the polarization is equivalent 
to a distribution of equal but opposite magnetic polarities over 
the two end faces, and eqn. (2b) shows that the iron may also 
be replaced by a uniformly wound thin coil forming a current 
sheet coincident with the curved surface. If the polarization 
is not uniform and the permeability is not constant the surface 
distribution must be supplemented in each case by a volume 
distribution inside the cylinder. Both these sources are equiva- 
lent in that, when put in place of the iron, they reproduce the 
magnetic field conditions at all points outside the iron surface. 
Inside this surface (where there is now no iron) the fields cannot 
be the same; in the one case the magnetic field strength but not 
the flux density is reproduced, whereas in the other the flux 
density in the model, but not the field intensity, corresponds to 
that in the iron. The total mechanical force (if any) acting on 
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the same as that which acts on the iron. 
In general, an infinite number of equivalent sources may | 
devised in which both poles and currents are combined. 


but including no other sources, so that the field is everywhe) 
zero inside the surface chosen.!! The equivalent source th} 
defined consists of a surface distribution of poles of density 


Ps = NoH;, . . . ° . . q 


where H,, is the normal component of the field on the outsi) 
of the surface, combined with a surface current of density 


Seiad J, = H, 


directed at right angles to the tangential component, H,, of t} 
external field. This ‘magnetic model’ of the iron is not subj 
to the restrictions of the previous two but is a complete equiy, 
lent of all the field sources enclosed within the surface. | 
particular, it is immaterial whether the permeability of the irq 
is constant, single-valued or otherwise; in all cases the pol} 
and currents are confined entirely to a surface. | 


(2.2) Forces on Equivalent Pole and Current Distributions | 


The surface tractions associated with each of the three equiv) 
lent sources defined by eqns. (1)—-(3) may be expressed in tert} 
of the magnetic field intensity at the surface; the value i 
ately outside the surface is chosen, since this is the same for ead 
of the three ‘magnetic models’. The field vectors on the tyj 
sides of the iron surface are related by the continuity condition] 


B, = B, pHi=H, . . Je 
Hj =H, B, = 1B, ann 


The surface pole intensity in the equivalent pole distributid 
defined by egn. (1) is therefore [from eqn. (15)] 


ps = ol — IH, = to(1 — “) H, “| 


and the tangential component of the surface traction is given |j 


F, = psH; 


from which the expression given in Table 1, line 1 is obtai e 
The normal component of the field is discontinuous, having] 


= 


Table 1 


Method * Assumptions 


1 Surface poles y constant, surface coincident 


with iron surface 


Surface force components* 


Volume forcet 


Tangential, F; 


1 
no(1 = 5) Hal 


Normal, Fy i 


1 1 | 
¥no(1 a rab rile + no(J X H) 


2 | Surface currents — nl — DAnH; $no(u2 — 1)H? Mp’ + yours x x 
3 | Combined surface poles and None Ho HH 4n(H,? — H?) 0 
surface currents 
_= 

4 | Virtual work As 1 and 2 0 bno(1 — =) + uH?)| pH’ + oud x HY 


* Field components are those on outside of surface (in vacuo). 


t+ Due to ‘free polarity’ p and conduction current J. 


Normal (”) direction positive outwards. 
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lue H, outside and H,,/, inside, so that (by subtraction of that 
irt of the field which is due to the surface poles themselves) the 
jrmal component of force is 


| Jamas yeaah ee) 


om which the expression given in the 4th column of Table 1 
‘derived. 

The same reasoning may be applied to obtain the components 
' force on the surface currents given by eqn. (2b), except that 
sre the tangential component of the field is discontinuous. 

In both cases the surface traction gives the entire force on 
le iron when there are no conduction currents in the iron and 
hen the permeability is uniform [so that the volume distribu- 
ons given by eqns. (1a) and (26) are zero]. The volume force 
ymponents which are due to non-uniform permeability may be 
tadily derived, but they have no practical utility. The force 
ymponents due to conduction currents in the iron are given in 
ection 3. 

The third equivalent source consists of only a surface distribu- 
on, as given by eqn. (3), and the force associated with this 
lves the whole of the force on the iron in all circumstances. 
he calculation of this surface traction is complicated by the 
ict that both tangential and normal components of the field 
re now discontinuous. This difficulty is most easily circum- 
ented by supposing that the magnetic poles and currents given 
y eqns. (3a) and (3d), respectively, lie on two different surfaces 
sparated by a negligible distance. Only one component of the 
eld is then discontinuous at each surface, and the discontinuity 
spresents the contribution made by the respective source. 
ach of the surface forces may then be derived in exactly the 
ume way as before. The result obtained by adding them— 
hich is the same whether the poles or the currents are assumed 
) be on the inside or the outside—is given in line 3 of Table 1. 

‘The three surface tractions obtained in this way are entirely 
ifferent from each other. They will, however, give the same 
otal force (as is proved in Section 4), provided that the assump- 
ons on which the first two depend are valid. 


(2.3) Surface Force in Terms of Energy 


The energy principle can be applied in a well-known 
vanner!®: 12 to calculate not only total force but also an equiva- 
nt force distribution. Assuming that there are no conduction 
urrent or ‘free pole’ sources included within the iron, and that 
<ternal sources are constant, the rate of change or total potential 
nergy when an iron part is moved in any arbitrary direction, 


, is 
ow r) 
oe = 4m ||] are 


here the integral extends over the volume of the iron part only. 
Vhen the permeability, bs is uniform the force density corre- 
sonding to the integrand is zero everywhere except at the iron 
face. The surface force intensity may be calculated by 
ssuming that the permeability varies in a continuous manner 
ough the surface. The result, which is everywhere normal 
) the surface, is given in Table 1, line 4. 


(2.4) Maxwell Stresses in Vacuo 


The total force on any group of field sources may be found by 
rawing a closed surface round them and integrating the Maxwell 
sld stresses in vacuo over this surface.!3 The stresses consist 
fa tension along the lines of force of magnitude 4y)H? and 
1 equal pressure at right angles to them. Resolving in the 
ormal and tangential directions relative to an arbitrarily chosen 
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surface,!* the component of stress directed away from the 
surface is F, = 4yo(H? — H?) and the tangential component is 

= oH, H,. 

tet fics is the same as that on the combined surface pole- 
surface current equivalent source (Table 1, line 3), so that these 
two methods of calculating force are essentially the same. 
However, in the Maxwell stress method the above components 
represent the force transmitted through the surface to the sources 
inside, whereas in the equivalent-source method the force acts 
on the surface. Both methods give the force on an iron part 
irrespective of saturation or hysteresis, and both give zero 
resultant force if the surface encloses no field sources. 


(2.5) Second Maxwell Stress System 


Maxwell'? proposed a system of field stresses in the interior 
of dielectrics and these may be applied by analogy® 1!) 15 to 
magnetically polarizable media (whether or not the permeability 
is constant). The tensile component of stress is 449(H')? along 
the lines of force, again combined with a transverse pressure of 
the same magnitude. Thus the components resolved in the 
normal and tangential directions relative to any interior surface 
in the iron are 


F,, = 4-0,” 
Ff, = no, H; 


cre (H;)7] (Sa) 


(5d) 


where primes denote field values in the interior of the iron. 

When the permeability is uniform and the iron contains no 
conduction current or ‘free pole’ sources, then, by direct com- 
parison of the two stress systems, the integration of eqn. (5) 
over any closed surface within the iron gives zero resultant 
force, i.e. there is no volume force acting on the iron. It follows 
that the difference between the field stress on the inside of the 
iron surface and that on the outside gives the same result when 
integrated as does the exterior stress alone. Thus the force 
associated with the differential surface traction is equal to the 
force on the iron. The expression which is obtained for the 
surface force is the same as that given by the energy method 
(Table 1, line 4). 

The volume force associated with the interior field stress is 
no longer zero when the permeability is not uniform and when 
conduction currents are present. Under these circumstances 
eqn. (5) does not give a zero result when integrated over the 
inside of the iron surface, and hence the differential surface 
traction no longer gives the force on the iron. In effect, the 
component of force which is subtracted when the interior stress 
is taken into account must be subsequently added back again 
as a volume force (conduction currents are taken account of in 
this way in Section 3.4). Clearly, for the purposes of force 
calculation, there is no significant practical advantage to be 
gained by taking the interior field stress into consideration. It 
eliminates the tangential component of the surface force, but 
this is, in any case, entirely negligible for all normal values of 
permeability. 


(2.6) Third Maxwell Stress System 


Maxwell!3 deduced a further system of ‘interior’ field stresses 
specifically in terms of magnetically polarizable media. 
Assuming a constant scalar permeability, the stress consists of 
a tensile component along the field lines of magnitude 
Nol — 4)UT ‘ combined with a transverse pressure of magni- 
tude 479(H’)?. This resolves into components normal and tan- 
gential to a surface:!4 


F, = 4nolQu — 1)CH,)? 
F, = peo, A; 


ca (H;)"] : (6a) 


(6d) 
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This stress system is mentioned in the literature less often than 
the previous one, but where it is described it is customarily 
applied to the calculation of force in exactly the same way, i.e 
as a surface integration only (see, for example, Reference 16). 
In fact, the differential surface traction, which is 


IN 1 
F, = 4oH2(1-——) =5— MP. 
270 ( a 2n 
FG (7b) 


does not represent the total force on the iron.’»9»!! There is, in 
addition, a force on each volume element even if the permeability 
is constant and there are no conduction currents. Maxwell’s 
derivation is based specifically on a volume force whose x-com- 
ponent is 


(7a) 


oH, dH, oH; 
Bi Mar te atest ane 
in terms of the polarization, M, and the conduction current, J, 
‘free poles’ being excluded. , 
The calculation of total force from eqn. (6) may be clarified 
by applying it to an interior volume element of the iron.* 


— JLB, — (8) 


Aes f 


+ JyB: 


Fig. 1.—Volume element. 


Taking an element of side 5x, dy, 5z (Fig. 1), the force in the 
x-direction on side A is, from eqn. (6a), 


pee NG 
AF] dns & ‘8 Gs ies x) 


= (4; +5 + oF be) — (ez phe i 8x) "lave: 


The contribution to the force on the element, in the same direc- 
tion, due to face B is, from eqn. (6d), 


(OF.)zp = he (2%: ae 2B 5) ( ip a5) dxdy 


and the contributions from the remaining four faces are obtained 
in the same way. Adding the six components, the total force 


reduces to 
Olds, oH, wees 
dF, = No ce 1) Hy, 5 Hy; = | sxby8: 
oH, 0H; oH POTS 
fj bol HE: ey 7 eae eT + A ei) dxdydz (9) 


which gives a force per unit volume in accordance with eqn. (8) 
when curl H’ = J and div H’ = 0. 
* This is essentially a matter of differentiation of the stress tensor, but the ele- 


mentary calculation is given for clarity. It is important to distinguish this from the 
slight, but important, modification used in Section 4. 


Thus the volume element is not in equilibrium under the in 
ence of the field stress only. By superposition, the force on } 
interior part of the iron is given by integrating eqn. (6) over 
bounding surface, and, in particular, the total volume fo 
acting on the iron is given by the integral of this equation o| 
the inside of the iron surface. The component of force whic! 
attributed to the interface itself [eqn. (7)] is obtained by s' 
tracting this same integral from the integrated field stress on 
outside of the surface. To obtain the total force the volu 
component and the surface component must be added toget 
when the two interior stress integrations cancel out and leg 
the force associated with only the exterior field stress—which) 
the force on the iron. ( 

The third Maxwell stress method is therefore useless ag 
practical means of calculating total force. The. surface fo 
given by eqn. (7) is not the force on the iron, and when } 
component which must be added to it is evaluated as a surfe 
integral the result is identical with the surface force obtain 
when the interior stress is ignored. 


(2.7) Other Field Stresses 


As Heaviside,!! O’Rahilly!5 and others have pointed o! 
there are many possible field stresses applicable to the intei 
of polarized media. Each different stress will be associat 
with a different volume force, but the total force acting on thei 
will, inevitably, be given correctly provided only that the stré 
reduces to the proper value in vacuo (which is unique except 
a constant). 

For the purposes of practical calculation of total force, the 
‘interior’ stresses are of interest only when they are associat 
with a zero volume force. The only stress which satisfies t) 
condition is that described in Section 2.5, and even this giv 
the result as a surface integral only when there are no cc 
duction currents and the permeability of the iron is constant. 


(3) ADDITIONAL FORCES DUE TO CONDUCTION CURREN’ 
(3.1) Conduction Currents and Magnetic Poles 


Three of the surface expressions given in Table 1 depend | 
the assumption that the iron contains no field sources oth 
than the polarization dipoles, i.e. they will in general give or 
that component of the total force which acts on these dipol 
The iron will experience additional forces if, besides bei 
magnetized, it conducts electric current or if it has any ‘fre 
poles’ immersed in it, ic. any magnetic polarity not account 
for by the polarization dipoles. The conduction currents foi 
a possible source of this polarity in that they may be replac 
by their ‘magnetic-shell’ equivalents, and a distributed polari 
as well as a distributed conduction current will therefore 
assumed to be present for the sake of generality. This is 
obvious importance in the analogous electrostatic problem. 

In practice, any currents which have a significant effect | 
the mechanical force will normally be confined to copper co 
ductors placed in tunnels or slots, in which case the currents m 
be properly regarded as outside the iron. However, it may 
convenient to lump together the forces acting on the copper al 
on the iron by ignoring the slot as such, i.e. the slot surfa 
In addition, the iron itself may occasionally carry large co 
duction currents. 

In one case only, namely line 3 of Table 1, the surface for 
expression is sufficient. Each of the remaining three expressio 
is obtained by subtracting from this result different force co! 
ponents (obtained by integrating over the interior of the im 
surface) which are associated with the additional field soure 
and these components have to be evaluated as volume integr 
and added back again to obtain the total force. 


(3.2) Volume Force in Surface-Pole Model 


| The required volume forces may be calculated by supposing 
jat the current and free polarity which is distributed over each 
| lit volume of the iron is concentrated in a cavity of negligible 
| fe. In the surface-pole equivalent model of the iron it is 
|mvenient to choose the shape of the cavity so that the total 
|rface polarity on its walls is negligible when it contains no free 
larity; this implies a shape defined by two parallel surfaces a 
lgligible distance apart and oriented so that they are also 
|rallel to both the field and the current vector. Under these 
\nditions there is no resultant force on the cavity walls when the 
irrent is placed midway between them, and the additional 
tee due to the current which is not accounted for by the inte- 
‘ation over the outer surface of the iron is simply the force on 
e Gmaginary) current-carrying conductor, i.e. 


F = 1oJ x H’ 


(10a) 


‘r unit volume, where J is the conduction current density 
id the cavity field is related to the field H’ in the iron by 
jn. (45). 

\If the iron contains free polarity of density p, then, when 
lis is placed in the cavity, the additional force per unit volume 
similarly 

| F = pH’ 


lowever, since div H’ = p/n there is induced polarity on the 
tity walls of the opposite sign and of magnitude 


(1 — 1/p)p 
tht the total force acting, in addition to that accounted for 
y the integration over the exterior surface, is 


Pe (105) 
102 


(3.3) Volume Force in Surface-Current Model 


It is most convenient to choose the cavity so that, when it 
ontains no conduction current, the total surface current in the 
alls is negligible. The cavity will therefore be supposed to 
ave the same shape as the previous one, but to be oriented so 
lat the field vector is perpendicular instead of parallel to it. 
Vhen a magnetic pole is introduced it induces no resultant 
irface current in the cavity walls, so that the additional force is 
ie force on the pole only. From eqn. (4a), this is 


F = ppH’ (11a) 
When a conduction current is present only that component 
hich is normal to the field can be replaced by a concentrated 
imrent within the cavity; however, it is only this component 
hich gives rise to an additional force. The force on the 
Juivalent conductor is F = uyoJ x A’. In addition, there 
re currents having the same sense induced in the cavity walls 
nd these will add to the force. Since curl B’ = pyoJ the 
lagnitude of the additional current is (4% — 1)J, and the total 
ice additional to that resulting from the integration over the 
‘terior surface is 

F = pnoJ Xx H’ (11d) 


(3.4) Volume Force in Energy Model 


Neither eqn. (10a) nor eqn. (118) gives actual forces on con- 
action currents in theiron. They represent essentially arbitrary 
»mponents of total force which are subtracted from the field 
ress in vacuo to obtain the surface traction. Similarly the 
parent volume force which corresponds to the surface traction 


; 
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given on line 4 of Table 1 is that which is associated with the 
second Maxwell field stress on the interior of the iron surface. By 
comparison with the stress in vacuo, the volume intensity of this 
internal force is seen to be 

F=pH’+Jx B (12) 
and when this is not zero it must be integrated and added to the 
surface force. 

The same result may be obtained using the energy method by 
considering the change in energy when the currents and poles 
are moved through the medium. This means that eqn. (12) 
gives the total mechanical force acting on field sources immersed 
in liquid or gaseous media. This is not, however, a purely 
magnetic force, but consists of the sum of a magnetic force and 
a hydrostatic pressure exerted by the surrounding fluid (e.g. 
Reference 12, p. 103). The ratio of these two components 
depends on the shape of the conductor in which the current 
flows, in the same way as the force on a conductor in a solid 
medium depends on the shape of the cavity made to contain it, 
and eqn. (12) gives no more insight than eqn. (10) or eqn. (11) 
into the actual magnetic forces associated with the conduction 
charges. 

The volume force expressions are summarized in Table 1. 


(4) EQUIVALENCE OF FORCE EXPRESSIONS 
(4.1) Method of Verifying Equivalence 


The four expressions for force listed in Table 1 give entirely 
different force distributions both over the surface and within 
the volume of the iron, but all should integrate to give the same 
total force in all applications where the permeability is constant. 
The equivalence may be verified directly by calculating, by each 
of the methods, the force on an arbitarily chosen volume element 
of iron. 

When the iron carries no conduction currents and its per- 
meability is uniform, the apparent volume force is zero in all of 
the methods considered except the third Maxwell stress method 
(whose results are not included in Table 1). But if a volume 
element is isolated from the surrounding iron by cutting a 
narrow cavity, one of whose walls forms the surface of the 
element, a surface distribution of force must be taken into 
consideration. The element becomes an independent part on 
which the force acting can be measured experimentally and 
determined theoretically by each of the methods described. The 
calculation is relatively simple, provided that the width of the 
cavity is small, so that a negligible volume of material is removed 
and the field conditions are not appreciably modified anywhere 
except in the immediate vicinity of the cavity.* 


(4.2) Combined Surface-Pole/Surface-Current Equivalent 


The force expression given on line 3 of Table 1 is the most 
general. Choosing a Cartesian volume element, as in Fig. 1, 
and expressing the force in terms of field values in the cavity, the 
component of force in the x-direction on face A is 


(SF.)4 = 40 I@ te oy * x) =(#, 45 +— ee) 


= + eas) | by8e 


The component acting on face B is 


oH, 
(SFB = No (2 He ee 82) (A + “Esa, dxdy 


* This also implies that the calculated force is unaffected by the cavity, and is 
therefore the actual force in the volume element. The method therefore eppears to 
solve the long-standing problem!1,!3 of the actual force distribution. 


This is, how- 
ever, irrelevant in the present context and will be discussed elsewhere. 
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and expressions may be similarly written down for the forces 
on the four other faces. By adding the six components a general 
expression for volume force may be obtained, but since the 
result is required only for the purposes of comparison it is 
sufficient to assume constant permeability. Expressing the field 
components in the cavity in terms of the components in the iron 
[eqn. (4)] the force on the element in the x-direction becomes 


Sy ake OF tee os 
<Z 2 / x , y , Z 
ae ae (u He — H, — He ) Sxdy8z 
Ror To aoe 
+ pno (Hy se ran t Bi + Hs *) 8x8y8z 


Substituting curl H’ = J and div H’ = p/n, the x-component 
of the force per unit volume reduces to 
Ve Lae oH, ie 
Ox. | M, ox nae 
which is somewhat similar, but is not identical, to the expression 
deduced by Maxwell and used as the basis for his third stress 
system (Section 2.5) 

The torque acting on the element can be similarly calculated 
from the tangential components of force on four of the faces. 
The result contains no first-order terms, so that the torque per 
unit volume is zero for a sufficiently small volume element. 


F,=pM, t pH ESB rps) 


(4.3) Alternative Methods of Calculating Force on Element 


All the alternative methods of calculating force require the 
addition of a surface force and a force acting on the interior 
part of the element. The simplest is that given in line 4 of 
Table 1 (i.e. the virtual-work or second Maxwell stress method), 
in terms of which the force on face A is the x-direction is 


(6F4 = $o(1 a z) (4+? 73 tis ay): 
+ u(H, + sds) +» (H, + Hess) | Spe 


(the field values being those in the cavity) and the only other 
face which contributes is the opposite one. The force per unit 
volume acting on the entire surface is therefore 
oA, oH, oH; 
Fy = pM, + M, i ae 

in terms of field components in the iron. The appropriate 
volume force is given by eqn. (12). When the two components 
are added the total force per unit volume is in accordance with 
eqn. (13). The torque is clearly zero. 

The force expressions given on the first two lines of Table 1 
may be applied in exactly the same way, except that in both 
instances there will be components in the «x-direction acting on 
all six faces and in the interior of the element. By either method 
the resultant is in accordance with eqn. (13). 

All the methods considered are therefore equivalent for a 
small volume element, and it follows that they must be equivalent 
for an iron part consisting of any number of such elements. The 
surface force components will cancel out at mutual interfaces 
and leave a resultant surface distribution over the outer surface 
of the iron only, plus a volume force distribution given by 
eqn. (10), (11) or (12), respectively. 


(5) APPLICATION OF SURFACE-FORCE EXPRESSIONS 
(5.1) Choice of Surface of Integration 


The first Maxwell stress, or combined surface-pole and 
surface-current, method has obvious advantages over the other 


forms of surface integral listed in Table 1, and this willje 
almost invariably, the most suitable method for pracig) 
problems. One of its principal advantages does not appeal 
be sufficiently widely utilized, namely that the surface of inte 
tion need not be chosen to coincide with the surface of the 
part on which the force is required. Many examples of fix 
calculation which give simple analytic results when appropiji 
assumptions are made are customarily analysed by the en ny 
method, and yet the same result may be obtained as simplyjj 
surface integration, provided that the surface is suitably cho 

A simple example is the force reaction between two sz 
poles, or teeth, tending to pull them into alignment (Fig 
Assuming that the poles have infinite permeability and that tht 
is a uniform-field region between adjacent pole tips and i) 
ligible field intensity at the remote tips, the alignment force | 


= 4nH’g 


where H is the field strength in the uniform-field region and } 
the gap length. This result is obtained by the energy met, 
by Moullin* and Tustin.!7 One method of verifying it all 
surface integral is to choose a surface such as ABCD [Fig. 2/ 
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Fig. 2.—Force on overlapping poles. 


(a) Usual integration surface. 
(6) Modified integration surface. 


which coincides with the pole faces on three sides, and is clos 
by supposing an indefinitely small air-gap within which 1 
plane AB may be drawn. This fictitious gap is necessa 
since the force given by eqn. (14) cannot be measured expe 
mentally without including the other iron parts to which 1 
pole is attached, or by making an actual air-gap. Under | 
assumptions specified there is no resultant component of si 
ways force contributed by the parts DC and CB of the surfa 
and there is no resultant force contribution due to AB, provic 
that the ‘fringe’ field of the main gap is negligible at the point 
The sideways force is therefore wholly accounted for by | 
face AD, and for this reason the force is customarily supposed 
act on the side of the pole (or tooth). 

When the surface is chosen in this way the sideways foreé 
difficult to calculate analytically (although not numerical 
But the difficulty disappears for other surfaces, both in this < 
in similar problems. If, for example, the plane AD is separa 


jm the pole surface by a distance sufficient to bring it outside 
} fringe field, then it no longer contributes to the force, which 
‘jw acts on the DC plane. The resultant integral, which can 
| evaluated by parts, gives a force in accordance with eqn. (14). 
)wever, it is more convenient to localize the force to one par- 
ular part of DC, and this can be done by drawing the surface 
dwn in Fig. 2(b). Here DC is interrupted at GE so that DG 
Jincides with the surface of the lower pole and EC with that 
} the upper one. DG and EC then contribute nothing to the 
‘fee, which is therefore given by the normal pressure on the 
tt GE; this can be evaluated by inspection and is in accordance 
th eqn. (14). To simplify the result further it is desirable to 
/parate BC from the pole surface as well as AD, so that B lies 
janegligible field. There is then no resultant force on AB due 
| |the fringe field of the lesser gap. 

|The surface-integration method has been found to be applic- 
le in an equally simple manner to every other example which 
is been examined and in which an analytic result is obtainable 
the virtual-work method. A further illustration of the choice 
||surface for a transverse-force, or torque, calculation is shown 
Fig. 3. The device illustrated, which is described by Tustin,!” 


chiceeeinaoen 


Fig. 3.—Torque motor. 


‘sists of a stator with two windings in slots at right angles to 
lich other and magnetizing a salient-pole rotor. The rotor is 
\bjected to a torque which may be expressed as a function of 
le field intensities at AA’ and BB’, provided that the fringe 
tlds set up at the various circumferential discontinuities do not 
\teract, and provided that the permeability is sufficiently large. 
he same result follows by inspection from the surface of inte- 
tation shown by the broken lines. 


(5.2) Numeric Integration of Surface Force 


|When simplifying assumptions such as those made in deriving 
yo. (14) are not justified, the energy method, as customarily 
dplied,-3 requires a numeric field solution before and after the 
irtual displacement. The displacement made must be large 
nough to produce a significant change in either flux or mag- 
8tomotive force. The change in field energy is then calculated; 
general this involves integrating the flux/current curve for each 
irn of the magnetizing winding, and then summing the results 
ir each of the field plots. When, on the other hand, the force is 
btained as a surface integral, only one field solution is required* 
nd no flux-current integration is necessary. Moreover, the 
umeric calculation may consist of only a minor modification 
fan analytic solution. For example, if the flux density on BC 
1 Fig. 2 is not negligible, the additional force component due to it 
lay be very simply and rapidly estimated and subtracted from 
gn. (14) once the actual field distribution is known. 

Tn practice, the information from which the force is obtained 


* This may be no direct advantage when calculating a force/displacement curve. 
lere is, however, an advantage in accuracy, since no differencing is necessary. 


: 
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will usually be given in the form of a field plot, as, for example, 
when the electrolytic tank is used. The surface integral must 
then be evaluated in terms of measured angles and distances, and 
the simplest finite-difference form to use is that suggested by 
Lehmann.!® Suppose one part of the surface chosen is an equi- 
potential such as PQ (Fig. 4), which might coincide with the 


Fig. 4.—Numerical evaluation of surface integral. 


surface of the iron when the permeability is large. Let the flux 
tube, RS, make an intercept Ar with the surface and represent a 
flux AD, From Table 1, line 3, the component of force on RS 
resolved in any arbitrary direction x at angle 0 to the normal to 
RS, is 


1 ,A®y2 
AF, => 2 =) At Cos 6 
Hence the total force on PQ in this direction is 
1 (A®)? 
= aoe 15a 
270 Po Ay 38 
where Ay is drawn at right angles to the x-direction and 
At = Ay cos 6 (158) 


The flux increment, A®, is usually a constant and may be taken 
outside the summation, so that the field analysis reduces to a 
series of measurements of Ay. Exactly the same construction 
may be used if PQ is a flux line instead of any equipotential, and 
RS represents an increment AF,, in m.m.f. The surface force is 
again normal to PQ and has the same magnitude, but its direc- 
tion is reversed. Hence 


eee al 


(16) 


In problems in which an attempt is made to allow for satura- 
tion the magnetic equipotentials may have an appreciable 
inclination relative to the iron surface. Where this occurs along 
parts such as DG and EC in Fig. 2(b) of the surface of integra- 
tion, it is most convenient to modify the latter so as to follow a 
zigzag path along the field equipotentials and flux lines, as in 
Fig. 5. If the potential increment along PP’ is such as to 
divide the flux tubes into curvilinear squares, the force acting on 
the whole of PQ may be obtained by summing eqn. (15a) [or 


Fig. 5.—Integration surface near saturated iron. 
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eqn. (16)] from P to Q with constant A®,. The Ay-increments 
are interpreted (with appropriate signs) in the same way for both 
PP’ and P’Q. 


(5.3) Torque on a Slotted Rotor 


In the combined surface-pole and surface-current equivalent 
model the current density is given by eqn. (3b), i.e. J, = Hi. 
Thus the line integral of H, round any closed contour drawn in 
the surface gives the total surface current linking this contour. 
But, by Ampére’s law, the line integral of the field intensity is 
determined by the conduction current flowing in the iron, from 
which it follows that the equivalent current, J,, corresponds 
to the actual current in the iron, but distributed over the surface 
in accordance with the above equation. 

It follows that the torque on an ordinary slotted machine rotor 
may be expressed in terms of the torque which would act on the 
rotor conductors if they were placed in the air-gap. The rotor is 
magnetically equivalent to a distribution of poles and currents 
over a cylindrical surface passing through the tops of the teeth, 
the intensity of these sources being defined by eqn. (3). When 
the permeability of the iron is infinite, each slot pitch in this 
surface contains a total current equal in magnitude to the corre- 
sponding slot current. The torque on the rotor is the same as 
that on this current, provided that the magnetic polarity is 
regarded. as placed on the inside of the cylindrical surface and 
the current on the outside. When the permeability is not 
infinite the surface current differs slightly from the slot current. 

The principle of equivalent surface distributions thus provides 
an exact basis for the orthodox treatment of slotted machines, 
in which the mean torque is shown to be the same as that which 
would act on the rotor currents when placed on the surface of an 
equivalent cylindrical rotor. The more exact model shows how 
these surface currents have to be distributed, in the actual air-gap 
field, so as to produce the same torque in all rotor positions as 
well as the same mean value. One result which follows from this 
is that the torque pulsations corresponding to the slot pitch do 
not disappear when the slots are closed, even if the permeability 
of the iron remains sufficiently large to be regarded as infinite. 
The effect of the slots is negligible only when they are sufficiently 
far from the surface for the tangential field component at the 
surface to be uniform over each slot pitch. 


(5.4) Forces acting on Toroidal Electromagnet 


Five different surface-force expressions are obtained when 
eqn. (7) is added to those listed in Table 1, and of these four 
differ by only very small amounts (i.e. by factors of 1/, or less) 
when the field at the surface of the iron has no tangential com- 
ponent. Because these differences are small, apparent anomalies 
arise if the attractive force tending to close an air-gap is assumed 
to act on the iron surfaces forming the two sides of the gap. 
Several writers*> >» 7,16, 19, 20,21 either analyse examples of such 
forces incorrectly or else point out apparent anomalies, the 
example generally treated being the split toroidal electromagnet 
(Fig. 6). It is therefore instructive to calculate the force tending 
to pull the two halves of:such a magnet together, using each of 
the expressions listed in Table 1 and taking account of all 
terms, whether small in magnitude or not. The calculations are 
simple if the core is assumed to be rectangular in section and if 
the air-gap is sufficiently short compared with its area for the 
fringe field to be negligible (i.e. the calculation refers to the 
limiting case’as the gap is closed). Since the flux density is 
uniform it is reasonable to assume constant permeability. 

The force is most easily obtained from the equivalent surface 
poles, since, for the toroidal configuration, the surface polarity 
is confined to the two sides of the air-gap. Hence, from Table 1 


Fig. 6.—Toroidal electromagnet. id 


a 


line 1, the restraint which has to be applied to the iron at e|| 
air-gap is, per unit cross-section, if 


Fy = 4o(1 — 25) He siete ¢ 


where H, is the field in the gap. If the mages windin} 
similarly. split into two halves these are also subjected td) 
force tending to bring them together. This force is very eai) 
calculated (by the Maxwell stress method) since it is unaffec) 
by the presence of the core; it is 


at each gap, per unit cross-section. Hence the combined fo 
on both the core and winding is 


F, + Fy = 4H? . . . . . (| 


in accordance with the Maxwell stress in the gap. 

In terms of the equivalent surface currents the force on | 
iron is confined to the cylindrical surface. The force on t 
equivalent coil may be written down by inspection by the sai 
method used to obtain eqn. (175), or it can be obtained | 
applying the appropriate surface-force expression in Table, 
taking moments about the end of the half-core and integrati 
to find the force applied at the other end. By both methods F 
force is in accordance with eqn. (17a). 4 

In terms of the combined surface-pole and surfaceceli 
equivalent the force given by eqn. (17c) acts on the end fa 
of the iron. This is not, however, the force acting on t 
half-core, as there is an additional contribution from the 
curved surfaces. The resultant of these is outwards, since t 
field strength is greater on the inside surface, and its value 
given by eqn. (17b), as can be shown by taking moments 4i 
integrating, or by inspection from the transverse Maxwell str¢ 
or other considerations. The total force on the iron is the 
fore in accordance with eqn. (17a). 

According to the virtual-work method, the force which a 
on the end-face is 


¥o(1 $7 =) He 


Again this must be supplemented by a radial force on the cury 
surfaces, giving in this case an inwards resultant and therefc 
increasing the pull. The contribution which this makes at t 
gap may be calculated by taking moments, as before, and t 
same total force is obtained. The example also shows 0 
advantage to be gained by applying the energy method in t 
form of an equivalent surface force; when it is applied direc 
by considering the effect of a virtual displacement of the ha 
core?! the energy changes at the two curved surfaces are diffic 
to allow for and are easily overlooked. ~ 
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‘The problem is analysed by du Bois,!9 Howe? and Carter2° 

the third Maxwell stress system, and the force on the iron 
‘culated by applying eqn (7) to the end-face only. The result 
not the actual force on the iron, but does correspond to that 
mponent of it which is due to the other half-core, as distinct 
jm the effect of the magnetizing winding. However, the 
_ysical significance of this result is fortuitous and the difference 
| tween eqns. (7) and (17a) is explained entirely by the fact that 
= former does not represent the force on the iron. The 
‘axwell stress methods are, in principle, unable to distinguish 
‘tween components of force due to different external field 
urces. 


(5.5) Forces Acting on Straight Electromagnet 


In the toroidal electromagnet the force acting on each half- 
fe is accounted for by the force on the equivalent poles in the 
gap surfaces. In general this will be true only when there is 
| component of flux density normal to the iron surface at any 
her point in the system, and it is of interest to examine the 
yplication of the different surface-force expressions to an 
ectromagnet in which this condition is not satisfied. A 
‘ple example consists of an infinitely long iron rod cut into 
fo by an air-gap which is again supposed to have a negligible 
ngth/area ratio (Fig. 7). The rod is magnetized by a uniform 
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Fig. 7.—Straight electromagnet. 


inding whose length is finite, but which is sufficient to establish 
uniform field in the region of the air-gap. For this arrange- 
ent the force acting on each half-core is given most simply by 
ie integral of the surface force whose tangential component is 
TO, SO that there is no contribution from the cylindrical surface 
hat is by the virtual-work method). From Table 1, the force on 
¢ iron, per unit cross-section, is 


F=4no(1— 2) #3 


yhere H, is the field in the gap); this is slightly different from 
e force which acts on the core of the toroidal electromagnet 
qn. (17a)]. 

Tn terms of the surface-pole equivalent, the difference between 
Ins. (17a) and (18) is due to the force on the magnetic polarity 
stributed along the cylindrical surface. This, as shown in the 
ppendix, can be evaluated quite simply by integration if the 
on is assumed to have a small cross-section, and it gives an 
itwards component which, when subtracted from the force 
ting on the end-face, produces the total force given above. 
ne remaining two surface-force expressions in Table 1 may be 
nilarly integrated, with the same result. 


(18) 


p (6) CONCLUSIONS 

Surface-integral methods of calculating forces, although well 
wn in principle, have not been as widely applied in practice 
their merits appear to warrant. The force on an iron part 
n always be obtained from a single field plot by integrating 
er a surface, irrespective of saturation, hysteresis and the 
esence or absence of conduction currents. The method is 
rticularly simple to apply numerically; in addition, it gives 
alytic results as readily as the virtual-work method, provided 
at the surface is appropriately chosen. 


i 
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The three Maxwell stress methods and four others examined 
lead to five different surface tractions. These have been proved 
to be equivalent, provided that, in all but one, an appropriate 
volume force distribution is added to the surface force when the 
iron carries conduction currents. The magnitude of the volume 
force differs in the four cases. Only that force distribution 
which is associated entirely with the surface has any practical 
significance as a method of calculating total force. 

The system of field stresses proposed by Maxwell for the 
interior of iron parts is the only method which leads to volume 
distribution of force when there are no conduction currents 
present and the permeability is uniform. Because of this the 
surface traction associated with these field stresses does not, as is 
sometimes assumed, give the force on the iron. When the 
volume force is taken into account in a practical calculation the 
method reduces to the evaluation of the field stresses in vacuo 
only. 

It is probable that none of the equivalent distributions 
examined corresponds to the way in which the force is actually 
exerted on magnetized iron. An alternative expression relating 
to a measurable force density, consistent with all of the methods 
of analysis, has been derived. 
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(8) APPENDIX 


(8.1) Calculation of Force on Cylindrical Iron Core 


The force tending to pull the two halves of the core together, 
in the electromagnet shown in Fig. 7, may be written down by 
inspection from the virtual-work expression given on line 4 of 
Table 1. Eqn. (18) results. The three other surface-force 
expressions listed in the Table include tangential components 
which must be integrated along the cylindrical surface of the 
core, as discussed in Section 5.5. The details of the calculation 
of force on the surface pole distribution are given here to illus- 
trate the method; the expressions listed on lines 2 and 3 of 
Table 1 integrate in exactly the same way. 

Consider a short length 5x of the core, distance x from the 
end of the magnetizing winding. Let the field intensity on the 
axis at one end of the element be H’; then at the other it is 


H’+ _ dx. But div H’ is zero, so that the radial field com- 
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ponent at the curved surface, when the radius of this surface is 
small, is 
0H’. are 


Ox * Sar Ox &> 


po 
ox 
inside the iron. The force on the magnetic polarity in the 


curved surface is therefore, from Table 1, line 1, and from 
eqn. (4), 


H, = 3 


2mrdx 


No (1 = 4) Hyp 


Hence the total force associated with this polarity is 
H, 
0 


where H; = H,/y is the (uniform) field inside the coil. This 
force acts outward, so that it tends to separate the two half-cores. 
Thus the force 


DISCUSSION ON THE 


Mr. P. Hammond (communicated): Forces on iron parts are 
often very difficult to calculate, and the author has rendered a 
great service to designers, whose attention should be drawn 
particularly to the fact that the surface of integration need not 
be the same as the actual surface of the iron. Careful choice 
of the surface of integration may shorten the calculation con- 
siderably. This is well illustrated by Figs. 2 and 3 of the paper. 

The author stresses the advantages of using equivalent surface 
distributions which reduce the field within the iron to zero and 
thus abolish the volume forces. These equivalent distributions 
are those used by Green* in his analysis of the electric field, and 
are often called Green’s equivalent strata. 

While the mathematical part of the paper is admirable, I am 
less happy about the physical content. Apparently the author 
considers that none of the methods he lists give insight into the 
actual force distribution. All the methods are apparently 
merely mathematical devices and the choice between them is 
a matter of convenience. Without wishing to appear ungrateful 
I would suggest that few engineers will be satisfied to leave the 
matter there. The designer must know where the forces act on 
the actual parts of his machine. Equivalent layers and Maxwell 
stresses just will not do. Nor is this request for knowledge 
unreasonable. There must be a definite answer in every case, 
and one could use strain gauges to discover it. There ought, 
therefore, to be a mathematical method which is physically more 
correct than the others. 

This method, I suggest, is the method of surface and volume 
poles given by eqns. (1a) and (1b). It is, of course, true that 
dipole interactions are insufficient by themselves to account for 
ferromagnetic behaviour. Nevertheless on a large-scale view, 
which averages the effect of many magnetic domains, the mag- 
netic forces are correctly described by a pole distribution. From 
energy considerations it is the free magnetism at the ends of a 
domain that matters. It is surely there that the forces act. On 
this point of view the volume distribution merely becomes 
another surface distribution of poles. A divergence of mag- 


* GREEN, G.: ‘An Essay on the Application of Mathematical Analysis to the 
Theories of Electricity and Magnetism’ (Nottingham, 1828), Article 7. 
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which acts on the surface polarity at the air-gap (Table 1, lin 
must be reduced by an amount 


No 
Fy = ple — DEG 


and the total force inwards on the core, per unit area, is 


10 1 
F,-F,= 2 te 


im accordance with eqn. (18). 

The method of calculation shows, moreover, that the f 
is a function of the air-gap field only, and is independent of 
field distribution along the axis. Thus in this respect also 
result-is consistent with the previous one. 
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netization within the iron draws attention to a lump of sla 
the iron. On the surface of this lump there will be sur 
polarity. If we use good-quality material there will not be m 
divergence within the iron and so the forces will act on the 
surface. Although the method of surface polarity may 
always be the most convenient one in calculations, I w 
suggest that it is the method that gives the closest physical insi 
Mr. C. J. Carpenter (in reply): Mr. Hammond has rais« 
most interesting topic in his reference to force distribution, 
it is one which I should have liked to elaborate. Howeve 
requires a much more detailed treatment than was possibl 
the monograph, and I must confine my remarks to two po 
I do, as Mr. Hammond suggests, regard all the method 
analysis listed in Section 2 as mathematical devices, and I a 
that it is not satisfactory to leave the matter there. But 
paper does go further, and the force density expression der 
in Section 4 [eqn.(13)] does provide a possible answer. 
There are several reasons for supposing that the equiva 
poles and the forces associated with them have no phy: 
significance, not the least of them being that the pole i 
is a purely mathematical concept. The basic element « 
polarizable material is the dipole, and the forces which ac 
the dipoles are not the same as those which act on the equivz 
distribution of poles. For example, every dipole of the mat 
—whether viewed at a macroscopic or at a microscopic ley 
will experience a force unless it happens to be situated — 
uniform field. The equivalent pole distribution, on the c 
hand, is associated with only a surface force (assuming cons 
permeability elsewhere and no conduction currents). The 
face poles may be visualized as the ends of highly idealized 
hypothetical dipole chains, and in this view the forces whicl 
on the poles correspond to forces on elements which consi: 
half dipoles. Such an element is, of course, wholly fictition 
Mr. Hammond suggests that energy considerations sup 
the hypothesis of polar forces. However, the use of en 
considerations to obtain force distributions (as distinct from | 
forces) is open to serious objections. In any case, as is sh 
in Table 1, it gives a different result from the surface pole met 
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MICROWAVE TUBES—AN INTRODUCTORY REVIEW WITH BIBLIOGRAPHY 
By A. F. HARVEY, D.Phil., B.Sc.(Eng.), Member. 


(The paper was first received 24th April, 1958, and in revised form 10th April, 1959. It was published as an INSTITUTION MONOGRAPH in 
September, 1959.) 


SUMMARY m = Mass of electron = 9-1066 x 10~3!kg. 
The paper reviews the various types of electron vacuum tubes m, = Voltage gain of amplifier stage. 
iployed for amplification and generation at microwave frequencies. m,, = Resonant value of m,. 
nphasis is placed on principles of epee and on nee een n = Refractive index. 
veloped to give high power output, oscillations at the highest fre- Bi es es 3/2 
lencies and low noise factors. The treatment is restricted to con- Z az ees AONE Oia 


ntional tubes in which the output energy is derived from the d.c. 


put. The subject is interpreted in terms of published work, the P, = Anode dissipation, watts. 


xt being closely associated with a bibliography which is complete aoe ower given up by electrons, watts. 
|to the Microwave Valve Convention of May, 1958. Po = D.C. input power = IpV. 

After a general introduction, the first part discusses grid-controlled Pou: = R.F. output power, watts. 

bes. It is then shown how the interaction of space-charge waves Q = Space-charge parameter. 

th resonant cavities and slow-wave circuits results in the various Q; = Loaded Q-factor of circuit. 

rms of drift-space and growing-wave tubes. The second part deals O,, = Unloaded Q-factor of circuit. 


th crossed-field interaction in planar and circular geometry and 
sludes an examination of the magnetron. An account is given of 
vel methods of generation of submillimetre waves and the usual 
urces of electrons are described. The third part analyses noise 
enomena in oscillators and amplifiers. 


r = Radial co-ordinate, m. 

r, = Radius of anode, m. 

r, = Radius of cathode, m. 

s = Subscript denoting slow space-charge wave. 
s = Reduction factor for plasma frequency. 

S = Poynting vector, watts/m?. 


LIST OF SYMBOLS Mi es tie a ee rea sag 
: ; = Absolute temperature, deg K. 

Ay = Direct beam current density (sheet), A/m. 
9 y ( ), Al T, = Cathode temperature, deg K. 


B = Susceptance, mhos. T. 
B = Magnetic flux density, Wb/m2 ¥ 

_¢ = Speed of light in vacuo = 3 x 108 m/sec. 

C = Pierce’s gain parameter. 

C = Capacitance of circuit, farads. 

eg — Cathode-grid capacitance, farads. V = Volume, m3. 

l,. = Grid-cathode spacing, m V = Voltage, volts 
_e = Charge on electron = 1: 6027 x 10-19 coulomb. V ; : 


= Average anode voltage. 
E = Electric field, volts/m. Ve = Hartree threshold voltage. 


= Room temperature ~ 290° K. 

v” = Linear velocity, m/sec. 

Up = Average electron velocity, m/sec. 

Vp = Phase velocity along circuit, m/sec. 

v, = Peak value of r.f. electron velocity, m/sec. 


a a a Ea ac ne ‘ Vin = Peak value of r.f. input voltage. 
fy subscript denoting last space-charge wave. V,, = Peak value of kinetic voltage of beam. 
f = Frequency, c/s. 


x 5 Vo = Accelerating voltage of beam. 
J, = Bandwidth, c/s. Ves 


F = Signal/noise ratio. 

‘im = Mutual conductance, mhos. 
G = Conductance, mhos. X = Reactance, ohms. 

G, = Resonant value of G. ie Y = Admittance, mhos. 

A = Planck’s constant = 6:6252 x 107" joule-sec. Y, = R.F. admittance of electron beam, mhos. 
H = Magnetic field intensity, AT/m. z = Axial distance, m. 


I = Current, amp. Z = Impedance, ohms. 


= Peak value of r.f. output voltage. 
V, = Peak value of r.f. voltage across resonator gap. 
W = Energy, joules. 


fs i Direct beam current, amp. Z, = Coupling impedance of circuit, ohms. 
aa Peak value of r.f. beam current, amp. Z, = Characteristic impedance of beam, ohms. 
o—=4/ — 1. ; s Z;, = Impedance of load, ohms. 
J, = Cathode current density, amp/m*. Z, = Cold impedance of tube, ohms. 
J,, = Bessel function of the first kind and nth order. Z,, = Intrinsic impedance of free space = 377 ohms. 


J, = Average value of r.f. beam current density, A/m?. 


= Att tion coefficient. 
k = Boltzmann’s constant = 1-38042 x 10-73 joule/deg K. 5 Se EES 


B = Phase-change coefficient = 27/A. 


s ee aa a h B. = Phase-change coefficient of circuit wave. 
eee eranicey NEDTYS. 8. = Phase-change coefficient of space-charge wave. 

L, = Inductance of cathode lead, henrys. 8, = Gap coupling factor. 
This is an sategrating paper’. Members oe invited bp amit papers in this y = Propagation coefficient = « + jp. 
egory, giving the full perspective of the developments lea ing to the present path = " r : 
ictice i in a particular part of one of the branches of electrical science. Ye Propagat ion coefficient of space charge WAN’ 
spiTespondence on Monographs is invited for consideration with a view to e = Dielectric constant. 

ication. bi) wm pgel os a! 9 
Dr. Harvey is at the Royal Radar Establishment. €) = Permittivity of free space = (1/3677)10~? farad/m. 
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7 = Overall efficiency. 
Nq = Anode efficiency. 
Ne = Circuit efficiency = P,,,/P.. 
Ne = Electronic efficiency = P,/ Po. 

@ = Angular co-ordinate. 

6, = Drift angle of klystron. 

kK = Relativistic factor. 

= Free-space wavelength, m. 

A, = Guide wavelength, m. 

= = Plasma wavelength = 2v9/w,. 
Ho = Permeability of free space = 47 x 10~7 henry/m. 

p = Volume charge density, coulombs/m?. 

Po = Average value of p. 

w = Angular frequency, rad/sec. 
w, = Resonant value of w. 
w, = Effective plasma angular frequency, rad/sec. 
Rationalized M.K.S. units are used unless otherwise indicated. 


Part 1. GRID-CONTROLLED, SPACE-CHARGE 
WAVE AND TRAVELLING-WAVE TUBES 


(1) GENERAL PRINCIPLES 


(1.1) Operating Parameters 


Electron vacuum tubes for use at microwave frequencies 
include conventional as well as special types!38 peculiar to such 
frequencies. Amplification may be at high level in order, for 
example, to increase the power transmitted, or at low level to 
enable small signals to be detected, in which case low-noise 
characteristics are desirable. In either case adequate gain over 
the required frequency bandwidth is necessary. Such amplifiers 
can be made to generate power as self-oscillators by feedback 
from the output to the input circuit with the correct phase and 
amplitude. 

Grid-controlled tubes operating at microwave frequencies are 
subject to three main effects. The first is due to inter-electrode 
capacitance: if this is tuned with an inductance, the output is 
given by 

pinaes &mVin 
ou GIR) + 7(@C = Ted] 


(1) 
The gain is 
m;, = | Vowel Vin 


Bn (2) 


= [I + RC — IfwLy |? 
being 3dB down at frequencies such that 

ROC a CD) att Ween ees nS) 
Hence the frequency bandwidth is given by 


1 8m 
fs 27CR 27 Cm, pia heh pe 
The product m,, x f, is termed the figure of merit and may be 
applied to many types of tube. For the 6AK5, a typical minia- 
ture pentode, it has the value 4 x 108c/s. 

The second effect is due to the cathode-lead inductance and the 
cathode-grid capacitance combining to produce an input con- 
ductance, w7g,,LC, cg» Which, for the 6AK5 at 200 Mc/s, is about 
600 micromhos. This effect can be reduced by incorporating?*4 
a suitable inductance in the screen lead or by using separate 
cathode leads for input and output circuits. The third effect is 
due to electron transit-time which, under static conditions, is 
given by 


T 16°, fT) x0 Sgee ne tN ey (5) 


For a spacing of 0-05cm and cathode loading of 1 A/cm? the 
transit time is 2-5 x 10—!%sec, representing about one-quarter 


period at a frequency of 900 Mc/s. If the transit time bety 
cathode and grid is an appreciable fraction of the r.f. per 
power can be absorbed in the grid circuit even when no ele 
current flows. The input conductance due to this powe| 
proportional to g,,7?f*, and for the 6AK5 tube it is al} 
180 micromhos at 200 Mc/s. 
The frequency of oscillation of microwave generators is sult 
to sources of instability. The variation due to ambient temp 
ture of a copper resonant circuit at a frequency of 3Ge 
about 1Mc/s for a 20°C change. Since the frequency, 
governed by the ratio of electronic conductance to the effec 
capacitance of the resonant circuit, it varies with electr 
voltage—the so-called frequency pushing. The frequency ve 
tion due to the load can be studed by means of the equive 
circuit in Fig. 1(@). The resonant circuit associated with 
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(9 


OUTPUT 
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Fig. 1.—Equivalent circuit of a microwave generator. 


(a) Equivalent circuit. 
(6) Simplified equivalent circuit. 


tube is represented by L,, C and R, and the coupling to 
load by L; and M. This circuit is simplified in Fig. 1(6) 
assuming an ideal transformer of ratio 1 : L,/M with an ind 
tance L,(1 — M/L,L,) in series with its primary and an ind 
tance L, in parallel with its secondary winding. The tra 
formed load impedance is given by 


bea bss mM? 
Ziv (2) |Z, e joo, (1 8 = 


At frequencies near resonance the electronic admittance becon 


Yu Mile is #) 4 
Z, + jol,(1 -; a) +@) Gt 
where 6 = (w — w,)/w,. From eqn. (7), putting 
Khe joo (1 & Zh) 


2 
7 = (en 
L, Th, We + Xx, 1 Z; = p.% 1 

When the tube is not oscillating, Y, is zero and the propert 
of the resonant circuit and the coupling to the load may 
determined!* by measurement of Z,, the cold impedance. 

At microwave frequencies, the output power and frequency 
an electron-tube generator will be measured in terms of — 
amplitudes and phases of the reflection coefficient of the lo 
and a Rieke or circle-diagram presentation is often employ 
A typical diagram is shown in Fig. 2(a), from which it may 
visualized that a load of constant v.s.w.r. but variable phi 
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TOWARDS 
LOAD 


The variation in power, P, is shown in terms of maximum power and frequency deviation, Af, from the mean frequency, fo. 


(a) Load at the generator. 
(6) Load at 10 wavelengths from the generator. 


tuld cause a cyclic variation of frequency which is small near 
} upper part of the diagram but much more rapid near the 
ver part or ‘sink’. The effect of a long transmission line can 
included by rotating every point on the diagram through the 
propriate clockwise or counterclockwise angle. Fig. 2(b), for 
line of ten wavelengths, shows that the conditions near the 
ik are considerably modified. For some load reflection 
efficients no oscillations are possible, whilst at others, frequency 
mps or splitting may occur. Such long-line effects are unde- 
able and may be eliminated by the use of non-reciprocal 
Tite devices.®* 


(1.2) Grid-Controlled Tubes 


Grid-controlled tubes, usable at the lower microwave fre- 
encies, are usually triodes and tetrodes. The addition of a 
pressor grid to form a pentode has not been found advan- 
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with a bandwidth of 10Mc/s. In general, tetrodes are 
favoured!5° below 1 Gc/s and triodes at higher frequencies. 

The electrodes of grid-controlled tubes?33 may be cylindrical 
or planar? 243 and, in order to reduce the inductance, are often 
brought out of the vacuum envelope via disc seals. In the 
‘lighthouse’ triode 2C40, suitable?°* for operation up to 3 Ge/s, 
the electrode diameters are 0-18in, while the grid-cathode and 
grid-anode spacings are 0:004in and 0-012in respectively. 
Another tube?4° has a cathode area of 0:2cm?, cathode-grid 
distance (cold) of 20, a grid system composed of parallel 
tungsten wires, diameter 7, pitch 25 4 and an anode diameter 
of 4-5mm. This tube gave a gain of 17dB at 4Gc/s with a 
bandwidth of 110 Mc/s and provided, as an oscillator, 150mW 
at 6Gc/s. A triode for frequencies up to 10 Ge/s is adaptable! 
for use in either lumped-constant or distributed circuits. 

Grid-controlled tubes are usually associated with coaxial- 


ANODE 
CONNECTION 


CATHODE 
CYLINDER 


ay 


sy LO 


PLUNGER INSULATING DISC, 


Fig. 3.—2 Gc/s lighthouse triode with re-entrant cavity. 
The cylinders have spring contact fingers. 


zeous, since the r.f. output voltage is relatively low when the 
de is adjusted for optimum efficiency. High-frequency opera- 
m is facilitated. by the adoption of improved circuits such as 
tributed amplifiers.32 Triodes,> whether acting as low- or 
th-level amplifiers, are generally used in an earthed-grid 
cuit!: 161, 239, 337 in which feedback is only via the small 
ode-cathode capacitance. For example, a miniature 6AJ4 
ode gave,!49 in such a circuit, a gain of 6dB at 900 Mc/s 


line circuits.13- 14; 69, 324,325 Several arrangements have been 


developed!27: 26°, and, as an example, a re-entrant triode circuit! 
is shown in Fig. 3. Radial-line and waveguide cavities are 
employed at the higher frequencies, and in one design?®> an 
annular cavity was simultaneously excited by a number of tubes 
placed symmetrically about the axis. An output of 500 watts at 
1Gc/s was obtained using fourteen 2C39 triodes. A small 
planar triode possesses?!® 24° a transconductance of 0:05 mho, 
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giving a gain of 10dBat1Gc/s. Other low-level amplifiers have 
been based on pencil triodes!3¢ and other tubes.54 113; 250, 291 

The output power in the medium and high ranges®® !>> is 
limited primarily by dimensions, which in turn are related to 
frequency, by anode efficiency and by permissible power dissipa- 
tion at the anode. For the largest dimensions consistent with 
satisfactory operation the output is given by 


© 0-015f?P39, 
out ~— PZ Na 


where f is in Gc/s. For example, a tetrode for 0:9 Gc/s would 
deliver 124kW with a gain of 20 and 40% efficiency with broad- 
band operation. A triode for the same frequency, with 
50% efficiency, cathode loading 1A/cm*, anode dissipation 
500 watts/em*, would have a power output of 90kW with a 
bandwidth of 8 Mc/s. 

A beam power tetrode for frequencies near 500 Mc/s con- 
tained!8: 19 a centrally located cylindrical anode surrounded by 
an array of 40 units, each containing a control grid, screen grid 
and cathode. With such an assembly, an interfering TE,;, mode 
distribution of electric field could arise and give reduced output: 
this effect was overcome by balancing with external sectoral 
trimmer-plates. The class-B operating conditions were 8:25kV 
8 amp anode input, 900 watts grid input, giving a useful output 
of 27kW at 41% efficiency. A modified tube, with lengthened 
envelope, gave 100kW at an efficiency of 70% with an anode 
voltage of 16kV. Developments?» 82, 9: 183 of medium-power 
triodes have included an example tunable over the range 
3-8-4-2Gc/s with a gain of 4, an output power of 10 watts 
and a bandwidth of 100Mc/s. Triodes have been designed%?? 
for use as stable oscillators. 


18 (10) 


(2) BEAM AND CAVITY INTERACTION 
(2.1) Space-Charge Waves 


It was first shown by Tonks and Langmuir3*! that an electron 
plasma or beam can support space-charge waves of an electro- 
mechanical nature. Their energy exists in two forms: electro- 
static energy, which is capacitive in nature and associated with 
the bunching together of the electrons, and kinetic energy, which 
is inductive in nature and associated with differences in the 
velocities of the electrons. The theory of such waves 
shows!3!; 213, 287, 309 that both TE and TM waves can propagate 
but only the latter produce*° axial bunching of the electrons. 
If a space-charge wave of small amplitude is superimposed on 
an electron beam, then 


= Io SE ice 108 (11) 
v= % a Oem a msOt (12) 
where (jw — voy)? = epo/me (13) 


In the absence of external fields, the real part of y is small 
and thus J and v vary sinusoidally with time and distance along 
the beam. Possible values of y, obtained by solution of eqn. (13), 
are 


(14) 
(15) 


In a finite beam surrounded?*? by conductors, the plasma fre- 
quency is reduced by a factor s which has been calculated? for 
various geometries. 

Eqn. (14) represents two waves travelling with phase velocities 
Vo/(1 + w,/w), one slightly slower and the other slightly faster 
than the electron beam. The group velocities of the two waves 
are the same and equal to the beam velocity. 


y =i(@w + w,)/% 


where w2 = epo/me . 


The electron motion may be replaced, so far as ener, 
concerned, by the kinetic voltage 


Vi, = — mvrpfe 
The real power flow in the beam is given?!” by 
P=3T,Ve + IfV,) 
The ratio of V; to J, represents a characteristic impedance. § 
tmvs = eVy 
the impedances of the fast and s!ow waves become respecti 
Zep = Vegllig = + 2w,Volwly ; 
Zes = Visllyy = — 2wWyVolwlo 


The electron beam behaves?? like a transmission line and 
usual impedance transformation relations apply. 

Since Z,, is negative, the power flow*!? along the beat 
negative, the group velocity is positive and thus the st 
energy**>> 346 must be negative. This is because the ch 
density is greatest in regions of less than average velocity 
least in regions of higher velocity. The opposite condit 
obtain with the fast wave and thus the power flow and st 
energy are positive. Practical electron beams are axially s 
metric and confined by means of a longitudinal magnetic f 
The conditions for the propagation of space-charge waves 
then modified. !92, 29° 


(2.2) Amplifier Klystrons 


Interaction between an electron beam and a resonant cé 
forms the basis of the klystron amplifier. Much of the orig 
analysis of this tube followed a ballistic treatment? 1% 107, 35: 
but greater accuracy can be achieved with space-charge v 
theory. Although radial-beam tubes with annular resona 
have been described,”°® the klystron usually has the < 
arrangement shown in Fig. 4. The input cavity sets up fast 
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Fig. 4.—Typical klystron amplifier tube. 


The electron velocity entering the buncher is v9 = (2e¢Vp/m)1/2. 


slow waves, the current modulations being equal in magni 
and opposite in phase, thus cancelling, while the velocity mod 
tions are equal and in phase, thus adding. If V; < Vo is 
r.f. voltage across the resonator gap the initial electron velc 
becomes 


py. Be yy om 
v= (1 = 2Vo* 
The velocity-modulated beam then enters a field-free drift s; 


ng which the fast and slow waves alter in relative phase, 
using!*? a current modulation of 
f= + FLO | jarzioo tot sin ee, 
Wp Vo 
te beam bunches and debunches with distance as shown in 
.5. The first (and largest) maximum of current is seen from 
in. (22) to occur at a distance of 4,/4. 


(22) 
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been developed.*> A four-cavity tube operating at 28 kV gave!?! 
an output of 25kW at 2-9Gc/s. With synchronous tuning of 
the cavities the gain was 60dB; by suitable stagger-tuning the 
gain was 29dB over a bandwidth of 51Mc/s. At a frequency 
of 1 Ge/s, a three-cavity klystron gave*8® 5kW with an efficiency 
of 30-40% and a gain of 20-30 dB. 

Very high pulse powers have been achieved with klystron 
amplifiers, the operating voltages being such that relativistic 
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Fig. 5.—Electron bunching due to velocity modulation. 


(a) is contour for highest efficiency. 
(6) is contour for zero fundamental. 


If the output cavity is placed at this position, its field excites 
nother pair of space-charge waves, the phases of the components 
sing such that the two fast waves cancel and the two slow waves 
id. Thus, the beam leaving the second cavity possesses nega- 
ve energy due, of course, to the positive energy abstracted as 
eful output. The r.f. beam current was shown by Webster?°4 
) consist of an infinite number of sinusoidal alternating com- 
ynents which are all harmonically related to the frequency 
)plied to the buncher. The fundamental component has an 
mplitude given by 


Baroy (BM ays 
I; = 215; Es sin Y 


(23) 


he term inside the brackets is the bunching parameter and for 
w-density beams has a value of 1-84 at the first current maxi- 
um. With optimum bunching the maximum efficiency is 
4%. With a bunching parameter of 3-83 the fundamental 
itput is zero. These results are modified in the presence of 
ace charge!!: 199, 357 and a focusing magnetic field.>” 
Under large-signal conditions the r.f. output gap voltage is 
arly equal to Vo and it is usual, for design purposes, to compute 
€ motion numerically for a number of electrons with different 
trance phases. An intermediate cavity may be placed along 
e drift space to give a cascade amplifier. The current induced 
this cavity by the bunched beam produces an r.f. field which, 
turn, produces additional velocity modulation. The increased 
agth necessitates improved focusing of the beam but the gain 
increased and the theoretical efficiency is 80%. Klystrons 
ith three or more cavities can, by suitable stagger-tuning of 
ividual resonators, be given a broad frequency response. 
ne attempt to attain high efficiency involved!** the use of 
upled harmonic resonators to give non-sinusoidal bunching. 
Advances in the design of klystron amplifiers have recently 
en reported.?0® 320, 330, 338 Practical two-cavity tubes have 
en described,?2° including an example?? for a frequency of 
'Ge/s. Multi-cavity klystrons have given®>>?°5 an output of 
W at 10Gc/s and a five-cavity tube for this frequency has 
} Vor. 107, Part C. 


effects!° must be considered in the detail design. A three-cavity 
tube, with a beam input of 370kV, 190 amp gave*® an output of 
30 MW with an efficiency of 43%: the frequency was 2-857 Gc/s 
with a tuning range of 100 Mc/s, while the pulse length was 
1 microsec and the heater power 800 watts. Similar tubes for the 
10Gc/s and 1 Ge/s bands have given peak outputs of 1-3 MW, 
average powers of 1-8kW and gains of 30-40dB: mechanical 
tuning over a range of 15° was possible. 


(2.3) Reflex Klystrons 


Amplifier klystrons can be made to oscillate by feeding back a 
fraction of the output power to the input circuit and adjusting 
the electron transit-angle to be 


6, = (4n — 1)n/2 (24) 


where nv is an integer. More usually, the input and output 
interaction-gaps are combined in a single cavity to give the 
reflex klystron shown in Fig. 6(a). The electron beam, after 
the first transit, passes into a retarding region where a suitably 
shaped repeller or reflector??+ causes it to reverse and pass 
through the cavity a second time. Maximum power is trans- 
ferred from the returning bunched beam when the drift angle, 
adjusted by means of the reflector voltage, is such that the rf. 
electric field at the gap has its maximum retarding value. Thus, 
as may be seen from Fig. 6(5), the optimum drift-angle is 
2n(n + 3), giving rise to various modes of oscillation.!3» 24 

The power transferred from the electron beam to the resonant 
cavity is given! !! by 


Tee = BeToV sin 0,5 (BV;84/2Vo) 


for low space-charge density. For fixed values of 0, and Vo 
the power output will have a maximum given by 


(25) 


Pax = 0°398IgVo/(n + 4) (26) 
Since the beam input is J)Vo the electronic efficiency?’ 3! 
becomes 
Ne = 0:398/(n + 4) (27) 
2 
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Fig. 6.—Operation of reflex klystron. 


(a) Typical cross-section of tube. 
(6) Optimum drift times. 
(c) Variation of frequency and power with reflector voltage. 


Eqn. (26) shows that the highest power output is obtained 
with a small value of n, i.e. when the reflector is at its greatest 
negative potential. This is illustrated in Fig. 6(c). The effect 
of space charge®® on the performance of reflex klystrons is com- 
plicated by the presence of the returning beam. Information 
on the general operating conditions of these tubes has been 
given by a number of workers.4! 44 114 137,377 Variation of 
the reflector voltage from its optimum value causes the beam 
admittance to become complex and hence changes*® the fre- 
quency of oscillation. This so-called electronic tuning is shown 
in Fig. 6(c) and is seen to be accompanied by variations in 
power output. 

A number of practical refiex klystrons have been de- 
veloped!#0, 215, 258, 261, 288 and measurements on their properties 
have included bunching efficiency,!® !7 hysteresis effects?33 and 
pulse operation.’7 332. For frequencies of 2-4 Gc/s the resonant 
cavity is usually external? to the tube proper. In the 2K28 
and CV35 tubes*>’ the inner portion of the resonator is brought 
out of the glass envelope by disc seals. This technique permits 
a wide operating frequency and such plug-in tubes have been 
developed?>? for frequencies up to 10 Gc/s. 

The manufacture of reflex klystrons for millimetre wave- 
lengths!%° 197,218 presents difficulties because of the small size 
of the component parts. The resonant cavity is usually placed 
inside the vacuum envelope, the output being taken via a sealed 
window to the external waveguide. Two tubes, which together 
covered the range 16-43Gc/s with an output of 5-10mW, 
employed!54. coaxial-line resonators. In a reflex klystron for 
35 Gc/s the construction included?”* a fundamental-mode cavity. 
With an input of 2kV, 10mA, the output power was 200mW. 
Low-voltage operation can be achieved by placing grids at 
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the resonator gap and examples for 50-60Gc/s have 
described.?8? Very high frequencies can be obtained?° by 
a resonator tuned to a high harmonic of the bunching freque 

In most reflex klystrons, mechanical tuning is accompli) 
by fiexing the diaphragm forming one of the walls of the ca’ 
In, for example, the 2K25 this motion is via a screw-oper 
strut, but in another tube, the 2K45, it is achieved by ther 
means? in which the strut, forming the anode of an auxi 
triode, is varied in temperature by controlling the current arri 
on it. By means of a bimetallic combination the strut defe 
the resonant cavity to change tle frequency of oscillat 
Klystrons provided with external cavities can be tuned 
plungers or, alternatively, by utilizing the variable permeat 
of magnetized ferrites.42 For example, in a CV2164 tube, 
resonant cavity was extended!!® to accommodate a lengt! 
waveguide containing a ferrite phase changer. The refle 
voltage was maintained_at the optimum value for power ou 
and tuning was~achieved over 9:2-9-:8Gc/s with a pc 
ranging between S-25mW. 

A modification of the reflex klystron is the multiple-reflec 
tube??! in which several electron transits are used to 
increased efficiency. It is, of course, important that the suc 
sive bunches have the correct phase, and an efficiency of : 
is usual for frequencies up to 10Gc/s. A typical multi-r 
klystron for 3-5Gc/s would give 10 watts output, the rang 
mechanical tuning being 300Mc/s. Another tube, whic 
suitable for millimetre wavelengths, consists!>»>° basically 
single-cavity two-gap klystron with a floating drift tube. 
theory of operation is similar to that of the reflex klystron 
by applying a separate voltage to the drift tube, thus varying 
transit angle, it is possible to produce frequency tuning. In 
monotron, the drift tube is omitted and bunching takes f 
during the long transit across the cavity. 

In the retarding-field oscillator? !43 the bunching takes p 
in a single region containing both an r.f. and a d.c. field. 
reflector, as shown in Fig. 7(a), forms part of the reso} 
cavity: such a design permits simple construction and 1 
mechanical-tuning range. When the electrons enter this ¢ 
bined field the operations of velocity modulation, bunching 
delivery of r.f. power to the circuit are all carried out in 
same interaction space. Ina tube>!>>? for 30 Gc/s, the reson 
had a diameter of 0:14in and a height of 0:05in. The coz 
was 0:102in long with inside and outside diameter 
0-065 in and 0-072 in respectively. A pointed probe, at refle 
potential, was used in the reflector to aid in the focusing ot 
beam returning to the anode. The electron gun gave a b 
diameter of 0-Olin with a perveance of 3-5 x 10~°amp 
volt3/?._ The output performance of this tube is show 
Fig. 7(6), from which it will be seen that the power outp 
above 40 mW over the frequency range 27-33 Ge/s. Other t 
have given powers of 500mW, while 300mW was obta 
at 50 Ge/s. 


(3) BEAM AND CIRCUIT INTERACTION 
(3.1) Growing-Wave Tubes 


Modification of, or interaction between, space-charge w 
in electron beams can result in their increasing in amplit 
In the resistive-wall amplifier23 the electron beam is surrou! 
by a resistive film which may be in the form of either strir 
hollow tubes. In the example of Fig. 8, both the input 
output couplings are via resonant cavities. The micro\ 
signal in the input cavity produces slow and fast waves in 
electron beam. As each wave travels, it induces currents it 
resistive wall surrounding it, with consequent dissipatio: 
energy and decrease of power. Since the fast wave has pos 
power this means that its amplitude decreases to a very s 


10-°in thick, supported on a thin glass tube. For a 
current of 10mA the resistive-wall tube had a gain of 
B compared with the 4dB of the same tube fitted with a 
Zin metallic drift tube. It has been found that the capacitive- 
ng effect of the glass support tends to reduce the gain. 
his can be partly overcome by etching the glass to a thickness 

*005 in or by mounting”? the resistive coating on an inductive 
the ‘easitron’ tube!®? 272,27 the electron beam is in close 
t with a periodic structure. In the inset of Fig. 8, the 
es of resonators is adjusted so that, at the operating frequency, 
ssless negative susceptance is presented to the beam. The 
ve plasma frequency becomes imaginary and the phase- 
e coefficients of the fast and slow waves are, on substitution 
in eqn. (14), found to be equal and given by 


B. rE ‘2 =(w + jw,)/V (28) 
le respective characteristic impedances are 

Le = ae aed = + JZ, (29) 

Z.. = —AimyVo _ a (30) 


wlo 
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Fig. 8.—Growing-wave tubes. 
Example shown is a resistive-wall amplifier with (inset) alternative ‘easitron’ structure, 
The slow wave, however, has negative power and thus The power flow along the beam, given by 
mplitude increases. ibe we 
1 one tube the resistive layer was a coating of tin oxide P= — jltplisZe + (—HiphsZ.)* - G31) 


takes place only when both waves are present. The two waves 
vary in amplitude with distance, the one increasing and the other 
decreasing, so that If-J,, is constant. Hence, by suitable input 
and output coupling, the increasing wave can give amplification 
of microwave signals. 

Methods of amplification have been proposed!>!: 262 in which 
the characteristic impedance is varied periodically along the 
beam. This results8+ in a stop band when the lengths are such 
that 


79 r A 

I = me aes 
m 204 4 G2) 

Wis T71V9 at roB 
In = 2a 4 a 


the subscripts A and B denoting the parameters of the alternate 
sections. These stop bands are characterized2”° by growing and 
decaying waves. Such space-charge wave amplification can also 
be analysed!® !©9 by considering the effect of the sudden steps 
on the velocity or current modulation of the beam. Eqns. (19) 
and (20) show that the beam impedance can be changed by 
variation of the accelerating voltage or of the plasma frequency 
by change of current density or of the factor s. 
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Fig. 9.—Space-charge-wave amplifier. 


The regions of different potentials result in growing and decaying wayes. 


Amplification by change of beam potential may be achieved by 
the velocity-jump tube?!:1°2 shown in Fig. 9. The electron 
beam is surrounded by a series of metallic cylinders of the 
correct length and at alternate steady potentials. At an upward 
jump, the fluctuating electron velocity is increased by the square 
root of the voltage ratio. The downward jump occurs at a 
plane of zero velocity modulation and hence has no effect. 
Amplification is thus achieved as the beam progresses along the 
structure. One tube, with helices for input and output coupling, 
gave a gain of 20dB at 2-89Gc/s with beam potentials of 66 
and 680 volts. 

In the stepped or rippled-wall amplifier,2” the beam impedance 
is varied by alteration of the radius of the drift tube. If the 
tube is expanded at the plane of a current maximum, the square 
root of the current is increased by a factor (s4/s,)*. Six quarter- 
wave sections were employed in a tube giving?* gains in the 
range 19-25 dB at frequencies of 2-3-1-8 Gc/s. 

The radius of the electron beam is varied in the scalloped-beam 
tube.2?3, 224 Small-signal theory2°! shows that it is possible to 
increase the r.f. current density in such a beam by arranging to 
have a low d.c. density in the region where bunching occurs and 
a high density where debunching takes place. Experimental 
tubes have amplified at frequencies of 1-4Gc/s, the scalloped 
shape being produced by suddenly introducing the electron beam 
into a longitudinal magnetic field at other than the correct 
equilibrium radius. 

Haeff !28: 129 showed that amplification can be obtained if the 
electrons of two beams, travelling with different velocities, are 
intermingled. It is necessary? !©° that the velocity distribution 
of the combined beam should possess distinct peaks: the condi- 
tion for amplification is that the slow wave on the faster beam 
should have nearly the same phase velocity as the fast wave on 
the slower beam. : 

Double-stream tubes may be analysed!4® 242, 269, 278 by letting 
the beams have average velocities v9 4, Vgz and plasma frequencies 


Wy4, Wp. The propagation coefficient is given!3? by 
aye oe 
val (jw Sie (jw ae fe 
The solution of eqn. (34) is of the form 
Y = & + jolVom (35) 
where U,, is the arithmetic mean velocity of the two beams. Let 
Vod = Yom + 6 and og = Yom — 8 (36) 


OUTPUT “CAVITY 
ELECTRON BEAM 


so that, if 6 < VA OF Yop, WpA = WpR = Wp: Eqn. (35) 0 


becomes 
2 2 1/2) 1/2 | 

Som = + 4 ( a +12 [4( oe a] \ | 

Wp Wom Wom My 


which is seen to represent four possible waves. A plc\k 
eqn. (37) shows the dependence of the propagation coeffic 
of the four waves on the average beam velocity, 
density, velocity separation and frequency. If the pararj 
w6/(wW,¥om) > 1/2, all four values of « are imaginary, | 
representing unattenuated waves, two of which travel faster jf 
Vom two slower. For larger beam currents, when w6/(w)\, 
< 4/2, two of the possible values of « are real, one neg/i 
and the other positive. When « is negative, the wave 
exponentially with distance along the beam. Further detail 
double-stream amplification have been given?!4: 252, 254, 263); 
experimental observations reported.? | 


ie) 


(3.2) Travelling-Wave Tubes 


Travelling-wave tubes are assumed here to result from}! 
interaction of an electron beam with the forward-trave) 
fundamental space-harmonic of a periodic or slow-wave sj 
ture. Such tubes were first described by Kompfner!85-!88} 
Pierce,7®°: 271,272 since when their detailed operation has i 
extensively analysed.®!; 200, 201, 202, 316,335, 340 A typical exar 
shown in Fig. 10, employs a helix as the slow-wave struci/ 
the phase velocity is about 0-1c, corresponding to a beam pai 
tial of 2-5kV. Such electron tubes, in which the only magi} 
field present is that required for focusing the beam, are us} 
designated?>° O-type. 

Combination of the propagation relations for the elec 
beam and the slow-wave circuit gives, for the case of s 
space charge, the basic travelling-wave-tube equation!?3: 30 


(y? + Bay — JB)? = — 2B.(CB.)? 


cues wale 
2B2P 4Vo 
is the gain parameter.?”*_ The quantity E2/(262P), relating) 
axial electric field to the power transmitted along the cir} 
has the dimensions of an impedance. 
Eqn. (38) is of the fourth power in y and therefore leac 
four possible propagation coefficients. In terms of 
theory,® 299 300, 301 the solutions correspond to fast and > 
space-charge waves and forward- and backward-travelling ci) 


where 
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| Fig. 10.—Helix-type travelling-wave tube. 


Propagation along the helix is from left to right. 


|waves. In operation of the tube, approximate synchronism 
- obtains between the first three waves, while coupling to the 
fourth is negligible. On interaction between the forward circuit 
| wave, which has positive power flow, the slow space-charge 
|waye, which has negative power flow, splits into a conjugate 
| pair as in the double-stream tube. Solution of eqn. (38) shows 
| that one wave decreases with distance, the amplitude varying as 
| @-0-8668-Cz, The other has an amplitude variation of ¢9°-8668:Cz 
corresponding to a gain of 47-3C decibels per wavelength. This 
‘growing wave forms the output available at the end of the helix. 
An equal negative power existed in the slow wave and thus the 
electron beam is finally left with less power than it possessed on 
‘entering the helix. The useful output of the tube thus comes 
from the kinetic energy of the electrons. 

__ Interaction with the circuit wave results in the fast wave, which 
has positive power flow, varying in phase only, according to the 

quantity e/@—©?:, However, with certain values of beam 
voltage and current, the circuit wave couples to, and is trans- 
“ferred!'9 to, the space-charge wave. Under this Kompfner!®? 
‘dip condition, the travelling-wave tube produces no output. 
_The power flow along the tube is constant, the energy being 
carried by the circuit near the input end and by the electron 
beam near the collector end. 

‘More detailed calculations of gain have been made 
and, in particular, the effect of space charge has been 
examined!4; 109, 247, 336 by the inclusion of Q, a dimensionless 

e-charge parameter2’2 which relates the capacitive impedance 
ween the beam and circuit to the resistive impedance of the 
These results enable the gain to be calculated as a 
‘function of beam current and geometry. Studies have also been 
‘made on the effects of velocity distribution35® and other 
factors’ 195, 199 on gain. 
‘Analysis of the large-signal behaviour of travelling-wave tubes 
has been made with a view to estimating performance at large 
‘power output and efficiency.®® 245,362 Partial large-signal 

ories have been presented,*9:245 assuming small gain and 
negligible space charge and attenuation. More refined theories 
have included effects due to space charge for both small3?8 and 
finite*? yalues of C and to other parameters.!®:235 Results 


) 


of computations have been plotted?9* graphically to show the 
‘Tf. voltage amplitude along the tube length for various values 
of OC, relative injection velocity and input-signal level. A study 
these curves suggests means of improving the maximum 
turation efficiency, such as by using a phase-changing device 
r lapering the helix pitch in order to reduce the wave velocity. 
In the travelling-wave amplifier, there is a direct electrical 
nection from the output to the input terminals. Mismatches 
set up a backward circuit wave, resulting in multiple reflec- 
S which cause fluctuations in gain with frequency® or, in 


73, 193 


extreme cases, instability and self-oscillation. Such effects can 
be reduced by introducing attenuation!’ into the circuit: it can 
be shown that for lumped attenuation® !8? only about one-third 
the inserted value is effective in reducing the forward gain. 
Experimental investigations have shown’* how power capacity 
is limited by attenuator configuration, low resistivities in some 
attenuators loading’? the circuit to reduce gain and power 
output. The effect of attenuation on tubes of high output has 
been studied by probe techniques.°° 

Non-reciprocal attenuation, such as that based on electronic 
methods2> or magnetized ferrites,°’ has the advantage that the 
forward gain is not impaired. A large effective attenuation can 
be obtained by the use of two separate helices??? which are 
coupled by the velocity modulation and r.f. beam current which 
persist in the intervening gap. In one example!” !78 the output 
helix was operated at a slightly different voltage, a gain of 
55-60 dB being obtained at 4 Gc/s. 

The extensive number of measurements on travelling-wave 
tubes include general performance,?48 small-signal charac- 
teristics,3°® 397 beam cross-section?°8 and the effect of reflec- 
tions.!7!_ For investigating some aspects of performance, graphi- 
cal methods have been employed.2°3 Large-signal properties 
have been investigated>® 74 175. 349 including accurate measure- 
ments’ on a tube scaled to a low frequency. 

The slow-wave structures employed in travelling-wave tubes 
have included interdigital lines, sinuous rectangular wave- 
guide!®° and, for frequencies near 10Gc/s, corrugated coaxial 
lines.!°3 In an application requiring performance over only a 
10% frequency band, a tube employing? a dispersive helix gave 
a gain of 20dB with a beam current of 100 uA. More generally, 
the helix is used in the non-dispersive region to give bandwidths 
of one octave or more. This circuit is usually inside the vacuum 
envelope although external helices have been employed.??5 226 
Auxiliary helices have been proposed?44 for coupling to the 
input and output ends of the circuit helix and to an external 
attenuator. If the tube is provided with internal’? 1°? or 
external feedback, a self-oscillator is obtained. A travelling- 
wave tube has been proposed3°? as a distributed buncher for a 
klystron, thus eliminating one tuned circuit: an amplifier of this 
type could give an output of 10kW with 35% efficiency, a gain 
of 18dB and a bandwidth of 40 Mc/s at 900 Mc/s. 

The travelling-wave tube has been mostly employed as an 
amplifier of low- and medium-level signals and numerous 
examples have been described.7% 84 179. 181, 294, 304 The mid- 
band gains are in the region 50-60dB, falling somewhat at the 
ends of an octave band. Tubes have been developed for 
0-7 Ge/s,!48 2Ge/s,®8 4 Ge/s?9® 393 and 7:5Gc/s:3% 3! an all- 
metal tube®® for 3Gc/s gave a power output of 4 watt for a 
beam potential of 750 volts and beam power of 15 watts. Lower- 
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frequency?? tubes have given 20dB gain over the band 70- 
500 Mc/s with efficiencies of 20%. At the other end of the 
frequency scale, an experimental tube?°* for 50Gc/s, with a 
helix of diameter only 0:015in, gave a gain of 19 dB. 

Design information*®’ on more powerful tubes has, in con- 
junction with suitable slow-wave structures, enabled mean 
powers of 1 kW to be achieved.3® Recent work*? has extended 
the application of travelling-wave tubes to high-power pulse 
generation. Good thermal dissipation has been achieved!!> by 
means of an iris-loaded waveguide with slot coupling between 
successive cavities. At the mid-band frequency of 2-85 Gc/s the 
gain was 35dB with an efficiency of 34% and power output of 
3MW. The input was 100kV, 62amp, and the 4in diameter 
beam was focused with a magnetic field of 1 000 oersteds. 

In the transverse?> travelling-wave tube the electrons are only 
permitted to travel through the interaction space a distance 
small compared with its total length. The slow-wave circuit 
may be a flattened helix, operation being achieved by skewing 
either the circuit, as in Fig. 11(a), or the beam,—as in Fig. 11(d). 
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Fig. 11. 


(a) Tube with skew circuit. 
(b) Tube with skew beam. 
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between a phase velocity of v, for the fundamental, when n = 
and the value v,(d + nA,)/d for a space harmonic of orde 
The phase velocity of a harmonic required for synchronism \ 
the electron beam can be much greater than that of a fundame 
wave. Thus, for a given beam velocity and working freque: 
there can be an increased separation of the circuit-elements. 
Space-harmonic travelling-wave amplifiers have employe 
then = + 1harmonic. One such tube contained?”®: 279 a rie 
waveguide loaded with 100 stubs, cut tranversely in the ri 
spaced 0:534mm between centres, 0:178 mm wide and fA d 
To obtain good coupling, the electron beam passed along t 
axial grooves in the ridge. The beam power was 1:2kV, 41 
with a magnetic focusing field of 1600 oersteds. At a freque 
of 50 Gc/s the gain was 20dB and the power output 20 mW. 
In backward-wave tubes interaction occurs with the n = 
ot first reverse space harmonic. The phase condition is the 


Oped = Vo(A, i @) ee 


where v, is now measured in the reverse direction: the cit 


FREQUENCY, Ge/s 
(e) 


‘Transverse-current travelling-wave tubes. 


(c) Theoretical yalues of gain and power for (i) conventional and (ii) transverse-current tubes. 


The properties of the tube are such as to give three forward 
waves which increase in amplitude exponentially, linearly and as 
the square of the distance respectively. When the distance is 
below some critical value, the gain is less than that of the 
equivalent parallel tube and varies more rapidly with current 
and frequency. A useful feature of this device is that the power 
output reaches a saturation value as the input signal is increased, 
which is dependent upon the performance of the individual 
beam elements. Experimental transverse-current tubes have 
shown®*: 1°! properties agreeing well with ‘the theoretical values 
given in Fig. 11(c). 


(3.3) Backward-Wave Tubes 


An electron beam can move in synchronism with, and interact 
with, one of the higher space harmonics of a periodic structure. 
In the case of a line loaded with stubs, as shown in Fig. 12, 
the condition for the electric field at the mouth of each of the 
stubs to be seen in the same phase by an electron is given by 


v,d = vod + nd,) (40) 


2, etc., and corresponds to the order of the 
The electron has no way of distinguishing 


where n = 0, +1, + 
space harmonic. 
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Drawn for n = — 1 with vy, in the reverse direction. 
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Fig. 13.—Types of backward-wave oscillator. 


(a) Unbalanced bifilar helical structure with a hollow electron beam. 
(6) Ridged waveguide with slots in the upper wall. 
7 (c) Ridged waveguide with slots in the ridge. 


wave and the electron beam travel in opposite directions. This 
teverse harmonic is well known to be dispersive—the phase 
yelocity varies rapidly with frequency. Since the condition of 
teraction is approximate synchronism between the circuit and 
sam velocities, it is evident that the frequency of operation of 
kward-wave tube can be varied over a wide range by change 
beam voltage. Such tubes!2>»19° are often known as 
inotrons. 
he principles?°® of beam-and-circuit interaction follow 
ely those relating to the travelling-wave tube. Analysis?» !42 
lows that the power/gain ratio can be expressed as A/(Iy — I,)’, 
here A is a constant near unity and J, is the starting current 
oscillation. The gain thus increases rapidly when J) exceeds 
I, and is confined to a narrow band of frequencies. More 
liable operation is achieved in the cascade amplifier,”!» 3% in 
h a first slow-wave structure, such as a helix, modulates the 
ctron stream by interaction with the backward wave and a 
ond helix is, in turn, excited by the modulation on the stream. 
advantage over a single-structure tube is increased stability 
‘cold’ isolation between input and output circuits. The 
dwidth, which is normally a fraction of 1%, may be increased 
newhat, with a sacrifice in gain, by operating the two structures 
slightly different voltages. A third structure, matched at 
h ends and also synchronized with the backward wave, is 
nd to increase the gain and the gain-bandwidth product over 
of the two-structure tube. 
ackward-wave tubes have a natural tendency to act as self- 
illators and are usually employed as such. More detailed 
ulations of the minimum starting current have been 
de!99, 347 and extended3® to show that the complex nature 
the propagation coefficients for backward-wave interaction 
be explained!!° on the coupled-mode theory. The efficiency 
these tubes has been investigated!23 on the assumption that 
t.f. current amplitude is equal to Jj: it may be shown that, 
h a helical structure, it is advantageous to use a thin annular 


A number of practical oscillators have been de- 
veloped.®: 96. 237, 248, 321, 342 The internal arrangement is usually 
th that the slow-wave structure is joined fo the output guide 
that end at which the electron beam is injected and, at the 
er end, to a non-reflecting termination. A reflected wave 
a mismatched load represents a reduction in useful power 
utput, but, since the condition for synchronism is not satisfied, 
not interact with the electron beam and thus does not 
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affect the frequency of operation. In the backward-wave 
oscillator of Sullivan,?23 shown in Fig. 13(a), the slow-wave 
structure is essentially a coaxial line wound into a helix, part 
of the outer conductor being cut away to allow interaction with 
the hollow electron beam. The outer helix is made by machining 
a groove in the inner cylinder. The inner wire is insulated from 
this groove by three dielectric strips and is brought back along 
the axis to the coaxial output. With a magnetic focusing field 
of 1 000 oersteds the tube operated from a frequency of 13-3 Ge/s, 
at a beam voltage of 3kV, to 2-6Gc/s at 40 volts, the output 
being in the range 50-5 mW. 

Other tubes?3?: 373 have oscillated over the range 8-15 Ge/s. 
An example!>® for 8-18Gc/s employed a tungsten-tape helix 
supported by three quartz rods as the slow-wave circuit in 
order to achieve adequately high impedance, low manufacturing 
cost, good match to the waveguide output and easy control of 
microphonic effects. The tape thickness was the minimum con- 
sistent with reasonable mechanical strength and the tape width 
was approximately half the helix pitch for maximum impedance 
of the reverse space harmonic. The helix diameter was chosen 
to be as large as possible consistent with avoiding anomalous 
tuning effects.2!7_ The helix pitch was then chosen to make the 
high end of the band correspond to a beam voltage of about 
2kV. A hollow electron beam was employed and the main 
practical data are given in Table 1. 


Table 1 


BACKWARD-WAVE OSCILLATOR DATA 


Frequency range, Gc/s 7°6-12°4 11-6-19-0 
Tape width, in 0-019 0:013 - 
Tape thickness, in .. 0-003 0-002 
Helix inside diameter, in 0-134 0-092 
Helix pitch, turns/in oe 24-0 B15 S) 
Beam outside diameter, in. . 0-122 0-081 
Beam thickness, in .. 0-010 0-004 
Beam current,mA .. 10:0 6:6 
Active helix length, in 2°15 1-60 


Backward-wave oscillators have been constructed!®* with a 
bifilar helix, including an example®? for the frequency range 
180-780 Mc/s. Tubes developed?4? for the ranges 1-2 Ge/s, 
2-4Gc/s and 6-11Gc/s employed interdigital lines, measure- 
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ments of their performance indicating that the basic ideas of 
backward-wave interaction may be correctly applied to their 
design and construction. An array of hairpin-like elements has 
been successfully employed! as a slow-wave structure. 

These oscillators have been found particularly successful in 
the millimetre-wavelength region. The main problems in such 
tubes are the mechanical realization of delay structures of very 
small size which allow the thermal dissipation of relatively high 
powers. A backward-wave oscillator giving 1-10mW in the 
frequency range 70-50Gc/s employed®! a helix of tape, size 
0:002in by 0:005in, wound with a 0-025in pitch. Similar 
tubes have covered the ranges 17-41 Gc/s”44 and 20-40 Ge/s. !?? 
Karp!® 163 employed the structure shown in Fig. 13(b), which 
consists of a ridged waveguide loaded with a series of transverse 
slots cut in the broad wall opposite the ridge. This slotted wall 
may more easily be prepared by winding a grid of gold-plated 
molybdenum tape around the waveguide, or alternatively by 
photo-etching a thin metal foil. The electron beam traverses the 
central portions of the slots, where the axial electric field is a 
maximum, while the slots are, with the ridge, gradually tapered 
away at the output end to present a match to the waveguide. 
Oscillations were obtained at 57-61 Gc/s by adjustment of beam 
potential to 900-1170 volts. Other similar tubes have been 
made to operate at frequencies up to 200Gc/s. In the arrange- 
ment?43 shown in Fig. 13(c) the electron beam travels through 
holes in a series of vanes: several such tubes have covered the 
frequency range 15-100Gc/s. A similar vane structure was 
employed! in a tube which oscillated at 40-51 Gc/s as the beam 
potential was varied in the range 1-6-4-3kV. 


Part 2. CROSSED-FIELD TUBES, ULTRAMICROWAVE 
GENERATION AND ELECTRON SOURCES 


(4) INTERACTION IN CROSSED FIELDS 


(4.1) Amplification 

Tubes in which there is a field normal to the electron motion 
are capable of realizing high efficiency, since it is the potential 
energy of the electrons that is converted into r.f. energy: the 
kinetic energy is unchanged and thus interaction can be main- 
tained with a slow-wave circuit over a long period. In the 
E-type tube*77: 3 an electron beam flows between the walls of 
a parallel-plane transmission line which is bent in the form of a 
ring. A transverse electrostatic field applied between the two 
conductors causes the beam to follow a circular trajectory in 
which the centrifugal and electrostatic forces are equal and 
opposite. The inner conducting surface supports a slow wave 
with which the beam interacts to give amplification. 

To obtain satisfactory phase focusing of E-type tubes, it is 
necessary that the number of electronic wavelengths per revolu- 
tion should be of the order 4/2. A longer interaction period 
between the beam and the circuit wave tan be achieved by 
giving the beam an initial axial drift velocity so that it spirals 
along the structure. This principle is employed in the spira- 
tron>”> in which, however, the r.f. interaction involves only the 
axial component of the circuit electric field and not the angular 
component. The helitron®!? can be visualized by considering 
a beam of electrons travelling in a helical path encircling the 
circuit, which may be a four-conductor transmission line. The 
r.f. interaction is between the electrons and the angular and 
radial components of the circuit electric field, and thus the tube 
is an E-type; no magnetic field is required. One example 
oscillated between 1-2-2-:4Gc/s with a tuning potential of 
650-1700 volts. The starting current was 0-4mA for a structure 
4in long. 


Crossed electric and magnetic fields are more usually employ: 
and this so-called M-type interaction enables the equivalent 
the O-type tubes to be realized. If the beam is enclosed 
walls having infinite admittance the only fields within the be: 
are those due to space charge. Growing waves are always 
up425; 467, 611, 618 since only a monotonic variation of electt 
velocity is required with crossed fields. Under certain con 
tions, growing waves may exist*3? if the wall admittance | 
resistive or reactive components. 

Travelling-wave interaction*®!: 429.439 occurs in the pla 
structure??!: 610 of Fig. 14(a), the electron beam flowing betwe 
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Fig. 14.—Crossed electric- and magnetic-field tubes. 


(a) Planar version of the T.P.O.M. tube. 
(6) Circular version of the ‘M’-carcinotron, 


a negative electrode, termed the sole, and a slow-wave or dé 
structure. There is a constant magnetic field perpendicular 
the direction of constant electric field and to the motion of 
electrons. If the delay line and sole are a distance d apart, V 
the voltage between them and 


Vo = V,|Bd . . . . . . ( 


then the path of the electron beam is parallel to the electrox 

When a wave is propagated along the delay line under c 
ditions of synchronism, the electrons interact to give ph 
focusing.°?? This process provides a mechanism by wth 
electrons that have absorbed energy have their paths cur 
towards the sole so that they are removed from the interact 
space. Electrons which give energy to the circuit wave 
curved towards the delay line, while being maintained in 
correct phase of the r.f. field, and form into discrete bunches 

Under small-signal conditions and with negligible space cha 
the increase, d/,, of the alternating current between plane 
and z+ dz along the beam must be proportional to the 
electric field and to the length dz, so that 


dl, = KEdz > eye a 
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The variation, dP, in the power flowing along the line is that 
due to the power gained by the beam. The latter is proportional 
to E_I,dz, and since P is proportional to E2 and dP to E,dE,, 


dE, = K’l,dz . (44) 
Elimination of J; or E, from eqns. (43) and (44) gives 
aE, aI, 
Be = VE, and ae — y71, (45) 


where y? =.KK’. More complete calculation shows that at 
synchronism ‘y? is real and is given by 


oes -° coth wd/v, (46) 
Where 6 is the distance between the beam and the sole. The 
value of Z, is given by 
Eee 


The field distribution along the line, putting 7; = E, = 0 when 
z = 0, is given by 


I, & sinh yz and E, © cosh yz (48) 


and y becomes the gain coefficient of the tube. These results 
are modified under large-signal conditions.*4* 

The efficiency may be estimated by assuming that the electrons 
jave initially a rectilinear flow along an equipotential, Vo. 
m the presence of the r.f. field, an electron moves towards 
he anode along trajectories orthogonal to the lines of force. 
[he electron receives from the constant electric field an energy 
(V; — Vo). Since its kinetic energy remains unchanged and 
qual to eVo, the energy e(V; — Vo) is transferred to the r-f. 
ield. Under these conditions 


(49) 


nm fact, not all the electrons reach the anode, and the above 
ficiency should be multiplied®!* by a factor of 0:8. The 
ficiency is also reduced by beam interaction®?’ with interfering 
pace harmonics and by space-charge wave amplification.®!? 
Another source of reduced efficiency is cycloidal motion of the 
lectrons, which gives them rotational energy not normally 
vailable for conversion into r.f. energy. The electrons in their 
. motion reach the equipotential, 4Vo, and the energy 
vailable for transfer is only e(V; —4V o). The electronic 
ficiency thus becomes 

t Vo 
rpaie0 (1 4) 


here A lies between 1 and 4, depending on the electron 
fajectories. 

The plane structure considered is not, of course, necessary, 
nd a tube in which the delay line is bent to circular shape is 
asier to manufacture and requires a more compact magnet. 
N one typical tube,°!3 an inter-digital delay line was used in 
der to achieve low dispersion and high coupling impedance. 
\t frequencies of 1-2-1-5Gc/s the gain was 10-20dB, the 
ficiency was 35-40% and the power output was 200-450 watts. 
he crossed-field principle can also be applied to high-power 
ulsed amplifiers, and, for example, a peak power of 1 MW, 
OdB gain, 40°% efficiency should be possible at a frequency of 
Ge/s with a bandwidth of 12%. 


: (4.2) M-Type Carcinotrons 


A backward-wave tube operating with a crossed magnetic 
eld results in the M-type carcinotron.39!: 435,538 The power 
; 


(50) 


flows in a backward direction along the delay line: dP is the 
negative of the power given by the beam between z and z + dz, 
and thus a negative sign must be inserted in eqns. (44) and (45). 
The boundary conditions are J; = 0 at z=0 and E, = 0 at 
z =1,, the circuit length. The gain conditions become 


(51) 


The starting current for oscillations is obtained, since cos y/, = 0, 
from the relation 


Ic sin yz and E, c cos yz. 


8 
ela ie 
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ep Up 

Space-charge conditions give rise to the diocotron effect,®!? which 

reduces the starting current. The electronic efficiency is approxi- 
mately that given in eqn. (50). 

These backward-wave tubes may exist in planar or, as shown 
by the example of Fig. 14(4), circular form. The frequency may 
be varied by means of the magnetic field or, as is more usual, 
by the line voltage. The tuning band is limited either by the 
cut-off frequency of the delay structure, or by practical considera- 
tions such as maximum voltage or maximum direct-power input. 
If the variation of power output is not a consideration, very wide 
tuning bands may be achieved, the variation of frequency with 
voltage being smooth and almost linear over the whole range. 
This characteristic can be used at a very high speed, and, there~ 
fore, for frequency modulation. 

The M-type carcinotron has proved useful for high-power 
high-efficiency operation.®!3 In one example, the power output 
was about 400 watts with an efficiency of 40%, the frequency: 
being variable over the range 1:75-2:85 Gc/s with an applied: 
voltage of 2:6-5:8kV. As with other backward-wave structures, 
the frequency of oscillation is insensitive to load impedance and. 
the lines of constant output on a Rieke diagram would be nearly 
circular. Inacoaxial design®!’ for pulse operation, the magnetic: 
field is produced by a large current passing along the inner con-- 
ductor. Other M-type wide-band tubes include the carmatron; 
oscillator®!® and the platinotron amplifier.*°” 


(52) 


(5) MAGNETRONS 
(5.1) Regimes of Oscillation 


One large class of microwave tubes, depending for operation 
on crossed electric and magnetic fields, is the magnetron. The 
basic structure of periodic circuit and electron beam is bent 
around in a circle so that the path of amplification is a closed 
loop: the tube supplies its own input and thus becomes an 
oscillator. The cylindrical sole takes the form of a thermionic 
cathode, so that electrons are emitted along the tangential length 
of the structure. From Fig. 15(a), the equations of motion of 
an electron are 


d*r . a = erB d@ e dV (53 
dt? dt m at m dr ) 
1 d,/ ,d70 d0 dr dO eBadr 
ph = pe 
r dt\” ae) To Gein |S 
Rearrangement and integration of eqn. (54) gives 
dd eB 3 
Gaon 3) re 


When the electron is at its most distant point from the cathode, 
the radial velocity is zero and thus 
ey 
dt 


V 
eet lee. iitdcans Seen 
mr 
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Fig. 15.—Anode voltage and magnetic field in a magnetron. 


(a) Geometry of magnetron. 
(6) Cut-off parabola and Hartree lines. 


If the electron just grazes the anode, the relation 
(57) 


gives the cut-off parabola of Hull**! shown in Fig. 15(6). Space- 
charge wave amplification and other effects are thought to 
prevent the cut off from being abrupt and complete in 
practice.475> 604 

The anode current of a diode is given by the V3/? relation of 
Langmuir®°? up to the onset of saturation, but it is modified in 
the presence of a magnetic field to become 


aera (3 ay 1) B 
me ue ; 


sites (58) 
where A is a slowly varying function of 7,/r,. Typical curves 
are shown in Fig. 16. If the cathode radius is small and space- 
charge effects are negligible, the electron moves in an approxi- 
mately circular orbit with angular velocity eB/m so that 


AB = 2nmefe . 


(59) 
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Fig. 16.—Voltage/current characteristics of a magnetron. 


The typical oscillation condition for 1200 gauss cuts the vertical axis at the Hartree 
voltage. 


R.F. oscillations corresponding to eqn. (59) are observed*73: 476 
on connecting a resonant circuit between anode and cathode, w 
negative-resistance low-frequency oscillations are obtained*’* 
when the anode is divided into one or more pairs of segm« 
between which a resonant circuit is connected. 


(5.2) Electrical Circuit Structure 


At microwave frequencies the magnetron anode takes the fe 
of a periodic structure, to a space-harmonic of which the elect 
beam couples. Since this circuit is a complete cirele, only we 
of those frequencies which result in an integral number of we 
lengths around the closed path can exist. The structure r 
be interdigital,482, 484, 507,509,510 but more usually takes 
form of a strip transmission line or waveguide series-loaded v 
stubs, as shown in Fig. 17. The axial-field variation is 2 
for a transmission line and half-sinusoidal for a waveguide, w 
the only radial mode of.importance is that giving a quar 
period-variation from the mouth of the stub. 

The circumferential modes are more complicated and fo 
stubs and x cycles of field variation around the anode, n 1 
have the values 0,1... N/2. All modes except N/2 are dot 
degenerate, allowing travelling waves to propagate in ei 
direction. The N/2 mode is a singlet and gives a standing w 
in any case. There are, moreover, higher space harmonics gi 
additional cyclical variations of electric field. In general, 
number of repeats is given*!> by 


m=n-++ PN. © eae 


where p is any integer, positive or negative, or zero. Nega 
values of m indicate components travelling in a direction oppc 
to the fundamental. 

If, for example, N is 8, the phase differences for the fur 
mental modes, p = 0, n = 1, 2, 3 and 4 are 7/4, z/2, 3 
and 7. These relations are illustrated in Fig. 18, where 
anode structure has been opened out.44© For the case of x = 
the space harmonic with p = — 1 is also shown. The ac 
calculation of the resonant frequencies is difficult, and accu 
results have only been given?® 5° for infinitely long anodes 

The separation in frequency between, say, the m-mode 
the unwanted modes may be increased by strapping. 
straps consist*!> 55° of thin wire or strip connecting alter 
segments, and, as shown in Fig. 19, may exist in many for 
The usual symmetrical forms of strapping are single-ring, ech 
and double-ring. Unsymmetrical strapping®°° may be achie 
by inserting breaks or by employing the YB-system. For m« 
other than the 7-mode, current flows in the straps, the inducte 
of which detunes the circuit. The 7-mode frequency is char 
slightly and may, in fact, be preset within limits by adjustn 
of the straps. The improvement due to either single- or dou 
ring strapping of a typical 8-stub anode is shown in Table 2 

Mode separation at frequencies higher than about 10 Ge, 


Table 2 


EFFECT OF STRAPPING AN EIGHT-STUB ANODE 


Resonant frequency 
Mode number 


No strapping Single-ring 


Double-ring 


(a) HOLE AND SLOT 


Fig. 


oe ee ee ee 
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nore easily achieved by systematic modification of the slow-wave 
tructure. In one method,4%? 57° the stubs are alternately long 
nd short. A typical example of an 18-stub ‘rising-sun’ anode 
; shown in Fig. 20(a). The resonant wavelengths of the struc- 
ure are shown in Fig. 20(b) and are seen to fall into two groups. 
\s the length ratio increases, maintaining the 7-mode wave- 
sngth constant, the wavelengths of the modes nm = 1 ton = 4 
ncrease, while those of modes m = 5 to nm = 8 decrease. The 
‘oltage antinode recedes into the long stubs and the net radial 
ield thus caused produces an interfering n = 0 or zero mode. 
: 


~. 


Fig. 18.—Mode patterns in a magnetron. 


The full lines represent electric field and the dotted lines represent magnetic field. 


There is thus an optimum length ratio of about 2. When the 
number of stubs becomes large, improved performance is 
obtained by partly or completely closing the anode structure 
with end plates.48° 

Another anode structure makes use** of the TEp,; mode in a 
coaxial cavity, concentric with an inner resonator assembly. 
The rear walls of alternate resonators are slotted through to 
couple the system. The anode can be tuned in frequency by 
plungers in the coupled guide, the construction being suitable 
for frequencies of 10Gc/s. A magnetron structure has been 
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Fig. 19.—Strapping systems. 


(a) Single ring. (c) Echelon. 
(b) Double ring. (d) YB-type. 
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Fig. 20.—Rising-sun magnetron. 
(a) Structure of 18-stub anode. 


(6) Mode plot, A. being maintained constant by choice of /; and /5. 


proposed in which the cathode and-anode are concentric toroi¢ 
the constant electric field is radial, and the magnetic field loo 
around the cathode azimuthally. The electrons travel along t 
cathode, at right angles to both electric and magnetic fiel 
The circuit wave which is excited by them follows the sar 
direction, and the anode is thus provided with transverse resona 
stubs. In all these anode structures, the coupling*?’ to t 
external waveguide or transmission line may be by probe, lox 
or slot excitation. 


(5.3) Electron and Circuit Interaction 


The interaction space of a magnetron contains a steady electt 
field between cathode and anode, a uniform axial magnetic fie! 
electrons emitted from the cathode, and an r.f. field betwe 
cathode and anode, consisting of a wave travelling slowly arou 
the anode structure and increasing rapidly in intensity as t 
latter is approached. ‘Fhe solution of this interaction proble 
is difficult, but-some progress has been made with analytic 
methods.*!9: 589 

The effect of a small r.f. electric field is to change the directi 
rather than the energy of the electrons. For oscillations 
start, the electric field must be large enough to give the electro 
a velocity equal to that of the r.f. wave. If the radial accelerati 
in eqn. (53) is made zero, then 


dV 2mv 9 
dr %(B oe) : es 


assuming the electron is travelling at velocity vy midway betwe 
cathode and anode. As a simplifying approximation let 


GV aila ( 
de Oa —r,) 
The condition of synchronism with the circuit wave gives 
af 
ON = = a +1). ..9 48 ee 


Substitution of eqns. (62) and (63) in (61) extends the conditi 
originally given by Posthumous>*® to give the well-known Hart: 


voltage: 
ate? S72) 1 
V_q =e 'p (1 -) ae 


A typical oscillation condition and value of Vj are shown 
Fig. 16. A nomogram for this threshold voltage has be 
given.47! 

Eqn. (64) represents the cut-off condition for the oscillati 
magnetron. Lines are plotted in Fig. 15 for various values 
n, the Hartree lines being tangents to the cut-off parabola. 
the anode potential is increased from low voltages, where curr 
cut-off obtains, oscillations occur at the N/2 line, restart at ’ 
(N/2) —1 line, and so on until the cut-off parabola is reach 
when the region of steady current prevents further oscillati 
These relations permit scaling®°* of a magnetron from one { 
quency to another. 

The exact mechanism of build-up of oscillations 
obscure.*94, 454 Experimental evidence*’> suggests that » 
dense space-charge cloud surrounding the cathode becon 
unstable and causes pre-oscillation before the threshold c 
dition of eqn. (64) is reached. The stability of this cloud | 
been the subject of much study.*5: 473: 478, 549 [Inclusion of ' 
effect of electron emission velocities predicts*°* a diffusion 
the cloud edges which is supported®?? by experiment. 
analysis>3?- 59!,599 of the pre-oscillating state, the adoption 
a statistical approach shows*5* 587 that peaks occur in the nc 
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etn These have been observed in practice,46! thus 
adi ting a smooth transition between the pre- and full- 
scillating states. Injection of a small signal enables4?® the 
acs of the oscillations to be controlled. 

‘The electron orbits under full oscillating conditions are 
ifficult to analyse, but in some instances numerical calculation 
as been made. Thus, Fig. 21 shows the orbits of the CV76 


4 
* 


A 


Fig. 21.—Electron bunching in a magnetron. 
The r.f. electric field at the anode causes phase focusing of the electrons. 


plots*®! four parameters: two associated with the input circuit 
and two with the output. Input voltage and current are plotted, 
respectively, as ordinates and abscissae, and on this plane, lines 
of constant magnetic field and constant power output and/or 
efficiency are drawn. ‘These curves are plotted for the mode of 
primary interest and for a fixed external load. If the overall 
efficiency remained constant, the power-output contours would 
be hyperbolae that approach the axes asymtotically. Practical 
charts bear some resemblance to the static characteristics of 
Fig. 16, as may be expected. An example*!> for an 18-stub 
rising-sun magnetron operating near 9-5 Gc/s is shown in Fig. 22. 
The variation of overall efficiency with current, for a constant 
field, is related to the extent to which the rotating space charge 
departs from synchronism with the r.f. field on the anode 
structure. In this particular tube, there is a zero-mode dip near 
3 500 gauss which is, however, below the operating field. 

The magnetron is well known as a generator of large pulsed 
power and the development of the original cavity type for 3 Gc/s 
has been recorded.39>: 525: 567; 568,569 Tubes were later con- 
structed*!>, 469, 624, 625 for frequencies in the range 1-24Gc/s 
with power outputs up to 1 MW peak. Further developments 
have been in the direction of longer life, more arduous duty 
cycles and improved frequency stability. Recent tubes*%> 544 
include a magnetron producing,°* at 1:31 Gc/s, a peak power 
of 10MW with 10microsec pulse duration and 17kKW mean 
power. The input was 66kV, 400amp, the magnetic field was 
1440 oersteds and the ring-strapped anode had 10 hole-and-slot 
resonators. 

The frequency characteristics of magnetrons have been the 
subject of much study.®2° 62! A limited range of frequency 
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Fig. 22.—Performance chart of a magnetron. 


Anode, 18-stub rising sun; frequency, 9-375 Gc/s; pulse length, 0-5 microsec; p.r.f., 500 c/s. 
The efficiency dip near 3 500 gauss should be noted. 


gnetron operating at a frequency of 3Gc/s. An electron at 
int A gains energy from the r.f. field and returns to the cathode, 
lile an electron at point B gives up energy and eventually 
ikes the anode. This bunching of electrons results in the 
ace-charge cloud taking on*®® °8? the spoke-line appearance 
own. 


(5.4) Practical Data 


The performance of a magnetron as a function of load impe- 
nce may be given in the form of a Rieke diagram. The output 
upling is normally adjusted so that a matched-load impedance 
es the optimum conditions for output and efficiency. The 
actical aspects may also be represented on a chart which 


tuning can be achieved as a result of modification*2* 54! of the 
slow-wave structure, for example by the simultaneous insertion*!> 
of conducting pins into the elementary resonators. Greater 
ranges are achievable with voltage tuning in conjunction with 
the use of a higher-order space harmonic :49: 552 such operation 
is facilitated by the adoption of interdigital structures.297 °° 
Rising-sun magnetrons are preferred>®?: *®3, 564 for generating 
high power at millimetre wavelengths.*7! An example®! for 
35 Gc/s gave 20kW with a pulling figure of 32 Mc/s: the input 
was 13kV, 10amp and the magnetic field of 10500 oersteds 
was below the zero-mode dip. Another tube®® gave 100kW 
peak, 40 watts mean power with a life of 200 hours. Using 
22-vane structures,*9? a power of 100kW at 48Gc/s has been 
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obtained with an input of 20kV, 25amp: another tube, for 
90 Ge/s, gave 10kW with an input of 10kV, 10 amp, a magnetic 
field of 26000 oersteds and a life of 100 hours. 

A different approach>”? to millimetre-wavelength operation 
has shown that magnetrons can oscillate with reduced efficiency 
at magnetic fields of about one-quarter of the conventional values. 
Typical minimum-voltage-regime tubes have covered*?’ the 
range 10-60Gc/s. Still lower voltages can be employed with 
spatial-harmonic operation.>/2, A typical anode can be thought 
of as having 12 uniformly spaced resonators, eight of which 
were omitted in two groups of four. Oscillations were obtained 
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amplitudes. A well-bunched beam results, in practice, in 
increase of radiated power by a factor of 107 or more an 
thus always employed. 

For a delta-function current pulse, the harmonic-cur 
amplitude, J,, is given by 2/). This enables high harmonic: 
the input frequency to be generated with the aid of suite 
circuits. For example, energy has been detected,>*3 correspo 
ing to the 12th to 65th harmonics, when an electron beam 
2MeV energy and bunched at 2-83Gc/s was passed axi 
along a 0-42in x 0:17in waveguide. Such harmonic 
quencies have also been achieved by the arrangement of Fig. 2 
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Fig. 23.—The harmodotron—a beam harmonic generator. 


Frequency 2-775 Gc/s; beam energy 1-1-5 MeV. 
The smaller cavity selects a high harmonic. 


by interaction between the electron stream and a component 
field having the same periodicity as that produced by the full 
number of resonators. At a frequency of 9Gc/s, a c.w. power 
of 75mW was generated with an input of 600 volts, 9mA and a 
magnetic field of 2000 oersteds. A low-field tube for 24Gc/s 
possessed*4” a rising-sun structure mechanically tuned with 
capacitive pins: at a fixed operating point, an output power 
exceeding 10 watts at an efficiency of 5% was obtained over a 
frequency band of 7%. 


(6) GENERATION OF ULTRAMICROWAVE FREQUENCIES 


(6.1) Radiation from High-Energy Beams 


Power at frequencies above 0:1 Gc/s may be obtained3®>: 38° 
by direct radiation from relativistic electron beams. If the 
beam is uniform, the contributions of individual electrons are 
random in phase and the energy radiated is equal to the sum of 
the energies of each electron. If the beam is suitably space- 
bunched, the radiation is coherent and it is the amplitude of 
the radiation field which is equal to the sum of the individual 


the harmodotron.*!*: °88 Electrons from a 20kV gun are 
passed through a pre-bunching cavity where they are give 
desired velocity distribution; they are then allowed to drift 
finally enter the accelerator cavity to emerge as a high-curr 
density tightly-bunched beam of small cross-section. The bi 
then passes through the harmonic-frequency cavity wl 
may be iris-loaded, as shown, or a plain-tunable type. | 
theoretical relationship for the generated power is typically 
the order of 

P2 = 2-704 f) x 10% 


With a 1 MeV, 5mA beam, bunched at 2-775 Ge/s, a powe 
1 watt at 36 Gc/s was obtained. The highest frequency obset 
was 95 Gc/s, corresponding to the 34th harmonic. 

It has been shown?3* 535,576 that three possible mechani 
of radiation from beams relate respectively to linear, sinuso 
and circular motion of the particles. Consider first the radia 
from an electron of velocity v, shown in Fig. 24(a), in mo 
along AA’, which is received at point P. As long as the velo 
component v cos @ in the direction AP is near the wave veloc 


Fig. 24.—Radiation from electron beams. 


(a) Acceleration in a straight line. 
(6) Transverse periodic motion. 
(c) Rotation in a circle. 


the radiation emitted later catches up with that emitted 
lier. The time taken by a spherical wave of potential, S,, 
mtred at P, to sweep over the charge is a measure of the 
ase of potential that is observed when this wave finally 
lapses on P. This time is proportional to 


Ee 1 
(v, —vcos@) ~ 1 — (v/v,) + H(e/v,)6? 
2 
a (9 + 6)? oh 


e A = (1 — v*/v2)'? = 1/x. The radiation is contained 
ithin a narrow cone centred on the direction of movement of 
ie charge, with angular opening 6 = 6. 

Such radiation was discovered at optical frequencies by 
enkoy*°’ and its origin was explained by Frank and Tamm.*8 
electron passing, for example, through a dielectric medium, 
luses polarization charges to be momentarily produced and 
e medium reradiates the transferred energy. A wavefront will 
rm, as in Fig. 25(a), in a direction @ such that radiation travels 
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vector normal to the surface represents radiated energy and is 
given by 


S = 4nF7*p307e-744n plc (70) 
where gq? = w*(1/v? — 1c) (71) 
d is the distance of the beam from the surface 
and NR = (1 S) (ae 1) [202 —- 1) (72) 
Further, S is given by 
y= NRLy(FAp)*e~ 294 (73) 


The quantity nrZ, can be interpreted as an intrinsic resistance 
for Cerenkov radiation in this geometry. For example, if 
Onc — 20 2 dee at Om tie) cl == Oro OL G/S8 Ag = 
10-!A/m, then, assuming F = 2 for perfect bunching, 


S=11 x 10-2 watt/m2 (74) 


which may be compared with the energy in the 10kV beam of 
1 kW/m. 
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Fig. 25.—Principle of Cerenkov radiation. 


(a) Radiated wave front. 


(c) Cylindrical tunnel. 


| 
q (b) Single flat surface. 
. 


om A to C. The particle velocity may be greater than the 
ave velocity, c/n, and thus 


. c[n =vcos8 . 


: (67) 
efines the conical shell of radiation. The thickness of this shell 
ecreases as the path of the particle in the medium increases. 
ince the particle does not lose velocity, the radiation is inde- 
endent of its mass. The power radiated is given by 


3 
7 a) Bee (68) 
here the integral is extended over all frequencies for which 
re Cy 

The generation of power at microwave frequencies by the 
erenkoy effect has been considered.38°: 433, 456, 457,547 The 
rrangement of Danos*?° requires the particles to move close 
) a dielectric surface. In the case of a flat beam, shown in 
ig. 25(b), the charge density was assumed to be given by 


P = poX(x)[1 + Fos (Bz — wr)] . (69) 


here F is the Fourier coefficient of the charge density corre- 
sonding to the frequency w. The component of the Poynting 


Suitable geometries for Cerenkov generators also include a 
flat beam sandwiched between two dielectric sheets, as shown 
in Fig. 25(c), a hollow beam in a cylindrical hole? in a dielectric, 
as in Fig. 25(d), and other arrangements.387: 593,539 The 
Cerenkov effect resembles in some respects the excitation®!® of 
surface waves, and corrugated-metal surfaces can function>** as 
well as a dielectric. 

Practical Cerenkov generators for microwave frequen- 
cies*#27) 494,501 have included the use of composite media’ 
and anisotropic ferrites.54© The arrangement of Fig. 26 
operated*?> at 24Gc/s. The radiation was excited by a flat 
electron beam passing closely to the dielectric material, which 
was 19cm long and made of polycrystalline TiO, with n = 10-2. 
The beam potential was 10kV, the current 0-2mA and the 
electrons were previously bunched by passage through a resonant 
cavity. The radiation contained the bunching frequency and 
its harmonics: the estimated power in the fundamental was 
10~7 watt. 

The second mechanism of direct radiation from beams depends 
upon perturbation of the electrons, which can be longitudinal*”® 
or, as shown in Fig. 24(4), transverse. Let S,, S; be two spheres 
of radii ct and c(t + Ar). If an electron takes time Ar to 
travel the spatial period J, from A to A’, the radiation emitted 
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Fig. 26.—Cerenkov microwave generator. 


Frequency, 24Gc/s. Refractive index of dielectric, 10:2. 
Output, 10-7 watt. 


during this interval is piled up in the shell between S, and $3. 
The thickness PP’ is 


v 
v,Ade(1 —~ cos @) = 40,63 + 62) (75) 
Up 
If the electron has carried out one period of sinusoidal motion 
between A and A’, the radiation will contain Fourier components 
with period At/x* or frequency 


2 2 


CK ms cK 
v,At =i Ip 


This maximum frequency is propagated along the centre of a 
cone and the radiation tapers off towards lower frequencies for 


f, max — (76) 


Table 3 


MINIMUM BEAM CURRENT DENSITIES 


Frequency, Tc/s 


of view in a time roQo/v,. One such pulse occurs during ei 
revolution and its thickness is 4v,A10% = 4r9/x?. Hence, 
harmonics of frequencies ranging from low values up to 


Tate a ck /ro . 
will be observed. 


(6.2) Higher-Frequency Limit of Coherent Generation | 
Methods of generating power at frequencies higher tila 


0-3Tc/s have employed the so-called mass radiators4?!; 459 wh); 
include black-body sources and spark-type oscillators.°>” ° 
Such radiation is incoherent: it resembles noise and cannot p}- 
duce independent interference phenomena. In fact, the out}i 
of all electron tubes contains some incoherent energy and theri 
evidence that, according to the type of tube, a higher-freque 
limit of coherent oscillation exists. 

The particular case of interaction between a bunched electri 
beam and the electromagnetic field in a resonant structure may}! 
analysed‘ by including the effect of thermal noise in the cav) 
but neglecting beam noise. The average r.f. beam curréf 


density must be such that 
J2 > 207 €y Woy OQ? V; | a 


where V; is the interaction volume of the cavity. Wy, 1) 
average signal energy stored in the resonant field, has a minimu} 
value governed by the requirement that it must exceed the no} 
energy. Thus 


Woy = heo||27(el?=*T, — 1) ees CG 
For a copper cavity with T = 300° K, the values of J; for typic 
microwave tubes at various frequencies are given in Table} 


Thus, for an r.f. beam intensity of 1 A/cm?, the frequency 
is about 10Tc/s. 


1000 


Current density, A/cm2 


larger angles. It will be seen that the observed frequencies are 
greater than the frequency corresponding to the spatial period, 
Io, because of the relativistic Doppler effect.>°” 

The electrons can be given a transverse motion by passage 
through an undulator—a device consisting of a succession of 
electric or magnetic fields with alternating polarity. The 
operating conditions for a magnetic undulator have been 
studied*!® 524 and a practical model has been constructed.°3¢ 
A bunched 5 MeV electron beam was provided by a linear 
accelerator, and, since the radiated intensity increases with the 
square of the magnetic field strength, powerful magnets with an 
array of steel teeth attached to the poles were employed. There 
were 25 pole pairs with a spacing of 4cm: the tolerance on the 
spacing was 0:02mm and that on the magnet flux 1%. Since 
the particles travelled at almost the velocity of light the radiated 
frequencies were given by the intersection of the v = c line in 
the Brillouin diagram of the periodic structure. In the experi- 
ments, small amounts of power, corresponding to 1 watt peak, 
were observed in the range 0-3-3 Tc/s. 

The radiation from a charged particle rotating on a circle of 
radius rg maybe examined with the aid of Fig. 24(c). While 
the particle has moved from A to A’, the cone of radiation of 
angular opening 6) = 1/« sweeps past the observer and is out 


2 x 103 2 x 106 


A further restriction on coherent oscillation is that the quantu! 
of energy, hw/27, which is 4 x 10~*eV at 0-1Te/s, becom 
comparable with the r.f. kinetic and thermal energies of electroz 
in the beam. Since energy exchange occurs only in discrej 
steps of one quantum, such effects modify both the amplitué 
and phase of the interaction between electrons and the electr 
magnetic field. 

In the particular case of interaction of an electron beam wi 
a resonant cavity, these quantum-mechanical effects cause th 
expectation value>’8:>79 of the electromagnetic field to diffe 
from the classicial value by a small amount which, in practic; 
means a small attenuation in signal. Of greater importance | 
the fact that there is an uncertainty or dispersion in this expecta 
tion value which represents noise in the output signal. Th 
physical origin of this dispersion lies in three sources. Th 
first is the quantization of the field>8° and is independent of th 
electron beam: it represents the uncertainty in the undisturbe 
electromagnetic field in the cavity resulting from its quantum 
mechanical properties. The second source of dispersion is i 
the wave properties of the electron: the fact that it is not a poin 
causes its interaction to produce an uncertainty in the cavit 
field. The third source, which is usually of negligible effect, is th 
classical interaction between the quantum-mechanical uncertaint 


art in the interaction with the field. Such effects were first 
idered>?° for the case of a classical electron and quantized 
and graphical data were given for various examples. This 
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(a) Plain oxide, internal heater. 
(b) Thorium oxide, internal heater. 
(c) Nickel mesh, internal heater. 


alysis was extended®? to the case of a quantized electron and 
ssical field, in which, assuming a gap voltage, V,, the 
abability of an electron gaining or losing n quanta is J2(q) 


pe: 27eV, sin 40, 
[lhe <b, 


eing the transit angle. In the limit of large number of 
ta transferred, this distribution tends to the classical value. 
practice, the increment in field due to a single electron is 
h smaller than the initial uncertainty in the field. If many 
ons are employed it is possible to define a ‘signal/noise 
io’ given by 


(80) 


to ohare, ola i 
re (137m Yd akp (81) 
e V, is the volume of the cavity. It is seen that this ratio 
ases with increase in frequency and increases with electric 
For example, if V,oc.A3, if the loaded Q-factor is 
S unity when the frequency is about 3Tc/s. The minimum 
ergy in eqn. (78) is then given by one quantum. 
ilar calculations***:°°3 confirm these predictions while 
urements on the interaction of a beam with a resonant 
jity have been reported. The transit time was an integral 
umber of r.f. periods, the resonator being excited at 24Gc/s 
ya magnetron. The beam voltage was 10kV and the beam 
ogle 0-004rad. The apparent noise temperature of the beam 
reased after transit and its dependence on the r.f. field indi- 
ed quantum effects. In similar experiments,**° using a trans- 
tse r.f. field, the fluctuations in the cavity gave a deflection 
ead of the electron beam. 


(7) ELECTRON SOURCES 
(7.1) Thermionic Cathodes 


Electron sources in microwave tubes are invariably thermionic 
athodes with peak emission densities up to 100 A/cm?. It is 
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difficult to combine such primary emission with useful life, but 
back bombardment due to returning electrons assists by causing 
the emission*’’: 488 of a large number of secondary electrons. 
Pure metal cathodes are rarely employed, since their emission 
densities are too low, although water-cooled metal cathodes have 
been used®°2 as pure secondary emitters. The plain-oxide 


cathode, shown in Fig. 27(a), employs a mixture of barium 
SINTERED THORIA Ni WIRE 
MESH OXIDE LAYER 


SLEEVE 


(2) HEATER © 
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Fig. 27.—Cathodes for microwave tubes. 


Ns Porous nickel, soldering-iron heater. 
e) L-type dispenser, internal heater. 
(f) Impregnated-tungsten dispenser, internal heater. 


and strontium carbonates supported on a nickel sleeve, and is 
satisfactory with pulsed emission densities up to 10 A/em?. The 
thorium-oxide cathode of Fig. 27(b) is usually employed in high- 
power tubes. Trouble due to arcing at high voltages may be 
reduced by packing the active material in the interstices of a 
woven-nickel mesh, as shown in Fig. 27(c). A variation of this 
cathode is that of Fig. 27(d), in which the oxides are held in a 
sintered matrix of coarse nickel powder. Such cathodes have 
been employed with success in the 3J50 magnetron operating at 
a frequency of 10Gc/s, and the 3J21 for 24 Ge/s. 

Recent developments have been associated with the so-called 
dispenser cathodes. One version, known as the L-cathode,>!! 
is shown in planar form in Fig. 27(e), from which it will be seen 
that the oxides are contained behind a porous tungsten emitting 
surface. In the impregnated cathode,>!? shown in Fig. 27(/), 
the alkaline-earth material, in the form of a mixture of normal 
and basic barium aluminates, is dispersed within the pores of 
the tungsten body, thus making for a simple construction. 
Cathodes for microwave tubes may employ a sintered mixture of 
thoria and molybdenum powders>*> and other materials.*?2 


(7.2) Focused Beams 


In grid-controlled tubes and magnetrons, the cathode, in 
conjunction with the positive electric field of the anode, is the 
source of the electron beam. More usually, the beam is emitted 
with a circular cross-section by an electron gun. Such guns 
exist in a number of forms°®” and one example,°**4 consisting of a 
concave cathode and a suitable beam-forming electrode, is shown 
in Fig. 28. 

The subsequent shape of the electron beam is determined by 
space-charge forces which are usually described by a parameter 
known as the perveance of the beam or gun. In some tubes, 
such as low-power reflex klystrons, the beam is short and the 
effect of divergence may be determined from the current flowing 
through a cylindrical tunnel. This current is a maximum when 
the minimum beam diameter is midway along the cylinder. If 
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Fig. 28.—Production of long focused electron beams. 


The electron beam is injected abruptly into the magnetic field of about 1000 oersteds. To save weight, a periodic magnetic field is employed. 


the tunnel has length /, and radius r,, the beam perveance has a 
maximum value of 


P ae = 9°75 X 10-6(4,]1,)? (82) 


The current density at the minimum cross-section is greater than 
that at the ends of the cylinder by a factor of 5:66. 

For a long electron beam, as required, say, in a travelling-wave 
tube, r, < /, and the limiting perveance is low. Positive-ion 
focusing is possible but has not found favour.°°? In practice, 
beams of high perveance are invariably obtained by magnetic 
focusing. In confined focusing,#9* a cathode of substantially 
the required beam diameter is immersed in an axial magnetic 
field which is made strong enough to restrict the transverse 
motion to an acceptable amount. In a more refined version, 
the electron paths in the gun region are designed to lie along the 
lines of magnetic field. 

In Brillouin>?? flow, the electrons, rotating about the beam 
axis, experience an inward force which just balances the outward 


one due to space-charge repulsion and centrifugal force. The 
beam radius, r,, is obtained from the relation 
a ex3i2 e Ber2y 1/2 
Ig = 560 =) Br} ( %—-— 2) Peter os 
The maximum perveance becomes 
a a Oca (84) 


the condition being such that the electrons all travel with the 
same axial velocity. This velocity corresponds to the potential, 
4Vo, on the axis of the beam, the remaining two-thirds of the 
total beam energy being rotational. In practice, much less 
intense beams, with Y= 1-2 x 10~*, are preferred. For 
example, a beam of 200mA at 3kV can be projected down a 
tunnel of radius 1-4mm and Iength 33cm with a loss to the 
wall of only imA. A typical method®°® of achieving Brillouin 
flow is shown in Fig. 28. 

Focusing magnetic fields may be produced by solenoids for 
experimental purposes or by permanent magnets for fixed 
operation. A field of 500 oersteds over a distance of 52in can, 
for example, be given*® by a pair of permanent magnets weighing 
31b. Eqn. (83) shows that the focusing effect depends upon H?, 
a fact which has given rise to periodic*”® >>> or ‘strong’ focusing. 
At every plane where the magnetic field changes sign, the direc- 
tion of electron rotation reverses but otherwise the beam is 
unaltered. The use of an array of N short magnets means that 
the stray magnetic field is considerably reduced,>27>>28 the gain 
factor lying between N and N?. Several practical assemblies 


have been developed4®?: 48; 409, 565 and one arrangement, em] 
ing a magnet of weight 1 1b 50z, is shown in Fig. 28. 

Periodic electrostatic focusing®?® has not found much prac 
application3®! because the beam perveances are lower. Ce 
fugal methods have been proposed*!®: 411, 413, 472 in which 
beam is set spinning at the gun region before entering the re 
of periodic electrostatic focusing fields. Unless a spi 
travelling circuit-wave or a collector which ‘unwinds’ the t 
is employed, the angular beam energy in these centrif 
systems is wasted. In another type of periodic electros 
focusing—slalom flow*!8: 419; 420__the electrons follow an ¢ 
potential in winding their way in sinuous fashion throug) 
array of conductors. 


Part 3. NOISE IN ELECTRON TUBES 


(8) OSCILLATORS 


The output signal from an electron tube is never a pe 
periodic wave: it is always slightly modified with ran 
fluctuations. This noise may be a function of the elect 
processes inside the tube, as well as the properties of 
structures associated with the microwave circuit. The 
output usually consists of a carrier, wide-band background 1 
and narrow-band modulation sidebands. The general source 
noise in electron tubes are well known:®7° they include 
noise, ®8!> 682 partition noise®3 and the modification du 
transit time.®7> 8 

Noise phenomena are important even when the tul 
employed as a generator.. Wide-band noise in the local oseil 
output of a superheterodyne receiver beats with the carri 
produce sidebands at the intermediate frequency. For a ty 
reflex klystron, the 723 A/B, the power density of noi 
2 x 10~!2 watt per Mc/s at 30 Mc/s separation from the ca 
Since this tube generates about 50mW of power, the noi 
this 1 Mc/s band is approximately 104dB. below the cz 
level. If the mixer of the receiver requires 1-5mW of 
oscillator power, such a tube would also supply 6 x 10~'4 
per Mc/s of noise. 

The performance of electron tubes as transmitters is influe 
by random fluctuations near the carrier frequency. These 
be considered to be made up of simultaneously existing pure 
and pure f.m. sidebands, with full correlation if both are cz 
by the same event, and zero correlation if they are not. 
usually f.m. noise which is of practical importance an 
causes have been studied.®88 Other theoretical and experim 
studies of noise have beenconcerned with klystrons,®>> 48: © 
magnetrons®3, 664, 666 and M-type carcinotrons.°43 


(9) AMPLIFIERS 


it is as low-level amplifiers that the noise characteristics of 
ron tubes have received most attention. Grid-controlled 
s°/2 may be used as such amplifiers, but their noise increases 
operating frequency. For example, the GL416B triode 
noise temperatures of 120°K and 1200°K at 100 Mc/s and 
s/s respectively. Klystrons show excess noise due to parti- 
. effects and conversion of thermal emission velocities to 
sity fluctuations, and are rarely used as low-level amplifiers. 
ood low-level performance is achieved with beam-type tubes 
, with care, only noise which is due to fluctuations in current 
ity and emission velocity predominates. The fluctuations 
sent at the input end of the electron beam excite space-charge 
and several theories treating the beam as a wave transmis- 
| system have been given.°37, 62, 680,682 More accurate 
are obtained®* 681, 85, 686, 697 by taking into account the 
oothing of the current fluctuations near the potential minimum. 
it was predicted®’?: °74 that the cathode noise varies periodically 
mg the beam. If /,,,, and J,,,, are the noise currents, then 


Dt 2 
Vrs | eal im 


ere A is a constant near ode es noise pattern is made 
two uncorrelated standing-wave patterns, due respectively 
he current and velocity fluctuations. If Z; and J, are their 
a, the minima being zero, then 


[rnin| |Lmax| cH Z| |Zol sin . (86) 


ey is the spatial phase angle. This analysis is supported by 
xperimental results®*? shown in Fig. 29(a). 


(85) 
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wer tube noise can be secured by putting the input end of 
slow-wave circuit at some critical-position along the beam, 
own in Fig. 29(b). A further improvement is obtained®* by 
loying drift tubes at different steady potentials. The result- 
changes in impedance can, by correct choice of length and 
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Fig. 29.—Noise on an electron beam. 

' Noise amplitude as a function of distance. 

(6) Noise reduction by optimum-length drift-tube. 
(c) Further noise reduction with short velocity jump. 
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position, result in de-amplification of the noise. This method is 
illustrated in Fig. 29(c), the potential of the first drift tube being 
jumped to that of the second at the point where the noise current 
is a minimum. This second tube is adjusted so that there is 
also a critical distance between the jump and helix. Some care 
must be used in designing velocity jumps which, being produced 
by sharp potential discontinuities, behave like strong electron 
lenses. Theory and experiment predict®® that such lenses tend 
to increase the noise in an electron beam as a consequence of 
the transformation of radial into axial velocity fluctuations. 

Estimates of the theoretical minimum noise figure for beam- 
type tubes have been made.®°!; 63, 654, 662,679 JF]. were zero, 
the noise figure of the tube would be unity. A higher noise 
figure exists°?© as a consequence of the lack of correlation 
between the initial noise current and velocity fluctuations of the 
beam. Making certain assumptions about the velocity and 
current fluctuations at the potential minimum and employing 
lossless beam transducers, a noise figure of 


Fr, = 1 4 '= 2) PT To’. (87) 
has been predicted.°7> For practical parameters, this gives 6 dB 
as the minimum noise figure for beam-type amplifiers. A rather 


better value is obtained by considering the conditions at the 
potential minimum. One analysis®°® gives results reasonably 
correct at the lower microwave frequencies, while computa- 
tions®!, 692 for a particular geometry give a noise figure which 
varies with frequency. 

A calculation procedure for low-noise travelling-wave tubes 
has shown®? that greater design flexibility is achieved by the 
use of three or more separate anodes. Jn such tubes, proper 
design of the electron gun is desirable,°°! with uniform emis- 
sion®* from the cathode. Tube noise may be affected by 
thermal-velocity spread,°°® emission of secondary electrons,®7! 
the focusing magnetic field®’” and partition effects.°7® Recent 
work®’7 has shown that an electron beam may be ‘cooled’ by 
certain processes. 

Information on the design of low-noise tubes has been 
given®>. 68,667 and measurements of their properties re- 
ported.®!, 699 These techniques have been incorporated in the 
construction of several practical tubes.®% 9° 669,684 Examples 
at 3Gc/s have given®42» 79 noise figures of 8dB, a typical gain 
being 16dB at a bandwidth of 100 Mc/s and beam current of 
400 4A. More recently,®*! a noise figure less than 4dB has 
been achieved. Low-noise tubes have also been developed for 
4 Gc/s®2; ©49 and 9 Gc/s.95 

The conditions have been studied for low-noise operation 
of transverse-field,°93 double-stream®®> and backward-wave? 
amplifiers. A general method of design for the last group, 
utilizing the circle diagram of impedance, has been presented :°4> 
it was shown that low-voltage tubes provide optimum noise- 
figure and tuning-range characteristics, conclusions which are 
verified in practice.°44 46 
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DISCUSSION ON THE 


| Dr. W. A. Gambling and Mr. B. G. Bosch (communicated): In 
juch an extensive review as Dr. Harvey’s it is perhaps a little 
infair to criticize any one part. However we feel that Section 8 
a little short and slightly misleading. For example, in tubes 
mploying long electron beams an appreciable amount of noise 
hay also be caused by the presence of positive ions. Sometimes 
‘hese manifest themselves as discrete oscillations which may be 
vasily observed, but often they cause an increase of the noise 
eer over a range of frequencies. Another point is that 

dulation noise is not necessarily restricted to a narrow band- 
width, but can occur over quite a broad band. 

No mention is made in the paper of noise in O-type backward- 
wave oscillators, and it may therefore be of interest to quote 
jome typical measurements which we have made at X-band 
lrequencies in order to illustrate the above comments. A useful 
figure of merit for some applications is the r.m.s. carrier fre- 
quency deviation, and for a C.S.F. CO43 O-type carcinotron 
under normal operating conditions the value obtained is 
54ke/s, with a correlation coefficient between the f.m. and 
1m. components of 0:35. For comparison, the corresponding 
igures for a CV323 klystron are 6-9kc/s and 0:57. The 
pandwidth of the system used in making these measurements 
was about 4 Mc/s. 

Noise sidebands of the same carcinotron and, again for com- 
Jarison, of a Varian klystron are shown in Figs. A and B. The 
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tig. A.—Total noise and a.m. plus background noise sidebands of 
O-type carcinotron CO43 at an operating frequency of 9:3 Ge/s. 


ipper curve in Fig. A gives the total (i.e. sum of a.m., f.m. and 
yackground) noise, while the lower curve shows the a.m. plus 
yackground component only. The difference between these is a 
neasure of the f.m. noise. The noise peaks at 0-67, 0-88 and 
-3 Mc/s are typical effects due to the presence of positive ions, 
ind the general increase in noise level over a broad frequency 
and is clearly shown. It may also be seen not only that f.m. 
loise exists near the carrier frequency, as implied in the paper, 
ut that, in, fact it predominates at sideband frequencies up to 
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Fig. B.— Upper and lower total noise sidebands of reflex klystron V58 
at an operating frequency of 9:3 Ge/s. 


(a) A mode centre. 
(6) At upper half-power point. 
(c) At lower half-power point. 


12Mc/s. A separate measurement at higher frequencies has not 
yet been made, but it is hoped to do this shortly. With local- 
oscillator tubes only the a.m. plus background noise is of interest, 
but with transmitting tubes it is the total noise which is of 
importance. For the klystron the upper and lower sidebands 
were measured separately, both at the mode centre and at the 
half-power points. As might be expected the carrier/noise 
power ratio is best at the mode centre. The results obtained 
with adjacent modes differed by only one or two decibels, the 
noise level increasing with mode number. It is interesting to 
note that in the range 20-60 Mc/s the noise levels of the klystron 
and the carcinotron are similar, although the power supplies 
for the latter were highly stabilized. 

To the already impressive bibliography we should like to add 
two more references concerning noise in backward-wave 
oscillators, which appeared before May, 1958. 
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Dr. A. F. Harvey (in reply): The term ‘narrow band’ was 
employed in the paper in a relative sense to distinguish noise due 
to modulation causes from that due to shot and similar effects 
which, making due allowance for any filtering action of the out- 
put microwave structure, is sensibly constant over an extremely 
wide range of frequency. Section 8 is necessarily brief because 
little has been published on modulation noise in electron tubes, 
and in this respect the contribution of Dr. Gambling and Mr. 
Bosch, reporting observations of effects due to ion oscillations and 
measurements extending beyond 10 Mc/s, is most welcome. 
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SUMMARY 

An optimum linear filter, in the mean-square-error sense, is no better 
than the optimum attenuator if there is no dissimilarity between the 
spectral densities of the signal and the noise. However, non-linear 
filters can use more statistical information about the signal and noise, 
so that, although they both possess the same spectral densities, a 
non-linear filter may be able to introduce a significant improvement 
in the mean-square error (i.e. it can do better than the optimum 
attenuator). For this reason the use of a non-linear filter in certain 
circumstances may well justify the greater difficulties encountered in 
its optimization and physical realization. 

The class of filters considered in the paper may be defined by a 
general expression relating output to input: 

R ao 
yQ) = X | o,7)6,[xC — 1) ]dr 
r=1Jo 

An almost routine procedure is proposed whereby the optimum set 
of weighting functions, w,(t), can be determined given either long 
enough samples of the combined input and the signal or sufficient 
statistical information about their characteristics. Some worked 
examples demonstrate that: 

(a) A significant improvement in mean-square error is possible even 
under the condition when the signal and noise possess the same 
spectral densities. 

(6) The class of non-linear filters under consideration can be 
optimized unhindered by the need to evaluate difficult integrals. 

(c) Although the physical complexity of the filter increases rapidly 
with the value of R, the mean-square error may converge rapidly to 
an asymptotic value as R is increased; in one example the performance 
was found to be within about 5% of the asymptotic value with R = 2. 


(1) INTRODUCTION 


In communication and control systems it often happens that 
information transmitted by the system becomes corrupted by 
the addition of an unwanted component called noise. The 
noise may be due to a number of causes such as crosstalk, dis- 
tortion or random disturbances. The relative magnitude of this 
effect depends on the system and its input. In some cases it is 
quite unimportant, in others the noise and signal may be so 
inextricably combined that the original signal information is 
veritably ‘lost’. As a measure of this effect we shall use the 
mean-square error: 


= Bue Seah b® —2)Pdt=[yoO—zor . Wd 


where y(t) is the system output and z(t) is the desired output, 
e.g. the signal component of input. 

If the input to a system is already corrupted by noise, it may 
be possible, by correct design, to arrange for the mean-square 
error at its output to be less than that at its input. Such a 
system is called a noise filter. It may be necessary for a system 
to predict future values of the signal from a knowledge of its 
past values, in which case it is called a predictor. The use of 
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linear devices for these purposes is well known, but, as she 
in the paper, non-linear devices can often give greater reduct 
in mean-square error. 

A filtering device must be able to identify and use some 
similarity between the statistical character of the noise ; 
signal components of itsinput. If the statistical description: 
the signal and noise characteristics are identical in all respe 
then no significant improvement in mean-square error is possi 
i.e. the optimum” filter becomes an attenuator. Furtherm<« 
if we choose to optimize a filter subject to the restrict 
that it belongs to a specified class of filters, we also rest 
the kind of information which the filter can identify ; 
use. Thus an optimum filter belonging to the linear class 
filters can produce a significant improvement in mean-squ 
error, only if the spectral densities of the signal and the n« 
are different, since it cannot use any other information. Ot 
dissimilarities in the characteristics of the signal and noise wh 
the linear filter has ignored, e.g. amplitude dissimilarities, 
be used by non-linear filters; thus a non-linear filter may 
able to produce a significant improvement in mean-square e1 
although a linear filter cannot do so. 


(2) SOME PHYSICALLY REALIZABLE FILTERS 


The following is a classification of some simple filters wh 
can be physically realized and optimized using the theoret 
method outlined in the paper. 


(2.1) Attenuator 


An attenuator is a devicet whose output at some instan 
equal to the sum of a term which is proportional to the inpu 
the same instant and a d.c. level term. (By this definition 
allow that the constant of proportionality may be greater tl 
unity.) 


Le: y(t) = kx(t) + K zo 1 ne 
(2.2) Zero-Memory Filter 

The output of a zero-memory filter is given by an instantane 
function of the input. The class of attenuators forms a sub-cl 
of the zero-memory class. 


H() = ffx(0] = | flee — 2]8@~)dr 
where 6(7) is the Dirac delta function. 


(2.3) Linear Filter 
The output of a linear filter, in general, can be expressed 


* We define ‘optimum filter’ as the filter which gives the lowest value of mean-sqi 
canis at its output subject to the usual restriction that it belongs to a specified ¢ 
ro) ters. 

t+ Although mathematically an attenuator is a special type of filter, and it 
improve the mean-square error, any reduction in mean-square error ‘that it ¢ 
produce is not considered to be significant for reasons explained in Appendix 8.3. 
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continuous linear summation of all past values of input in 
he form of the convolution integral 


co 


y(1) =| xc Mandi oA 


Vhere w(t) is the filter-weighting-function or delta-function 
esponse and x(Z) is the filter input. The class of attenuators 
orms a sub-class of the linear class. 


(2.4) Some Non-Linear Memory Filters 


A simple non-linear memory filter can be realized by connect- 
hg a zero-memory filter and a linear filter in cascade. Expres- 


Fig. 1.—Zero-memory filter followed by linear memory filter. 


jons relating output to input are given below for several 
anonical forms of memory filters. 


y(t) = | flee = Mlg@dt 2 so ©) 


see Fig. 1) 


R co 
WO= > [Eke DJo@dr . . . © 
r=0°0 


3oth eqns. (5) and (6) are special cases of the following equation 
Which defines a class of filters, ,: 


y(t) = | KL “gatialing Hoi becos @) 


_ Expanding the kernel function K[x, 7] in terms of a set of 
inearly independent or orthogonal functions 6,(7), 


Hees [ @b@-Dlo@dr 2. . @) 
r=0 °0 


By using a finite number of terms in the expansion, the filter 
san be physically realized (Fig. 2) and made to approximate to 


Fig. 2.—Multi-path memory filter. 


any filter defined by eqn. (7) with any desired accuracy, assuming 
hat the expansion converges. Filters for which the input and 
Jutput are related by an expression of the form of eqn. (7) are 
members of class 7;. The classes of linear and zero-memory 
ilters are both sub-classes of 7, since eqns. (3) and (4) are of the 
same form as eqn. (7). A filter of class 7; which is realized using 


only R parallel paths, each path representing a term of the series 
expansion, is called a multi-path filter of class n;r. The class 
N1R is a sub-class of 11, when N > Rand 7,y is a sub-class of 71. 
Still more general classes of non-linear memory filters can be 
defined! 2 by generalizing eqn. (7) and in other ways.?>4 


(3) OPTIMIZATION OF FILTERS OF CLASS nj 


The optimization procedure outlined below applies not only 
to filters for separating signal from noise with which it has 
become contaminated but also to the more general case in 
which the desired output depends on the input or some part of 
it in some linear or non-linear manner, as exemplified by a 
predictor or a demodulator. The term ‘filter’ is used in the 
general sense to cover all such cases, and, where necessary, the 
term ‘noise filter’ is used for the special application of separating 
signal from noise. 

The optimization procedure consists of minimizing the mean- 
square error defined by eqn. (1): 


ce eae {t 2d 1 
2 = tim 5 [x — e0Pat W) 


When the filter is optimized as a noise filter, the desired output, 
z(t), is equal to the signal component of input, and 


AO E120), OD] je 24 say O) 


where v(f) is the noise component of the input and g is a function 
of both z and v. Two examples will be mentioned: 

(a) In transmission of intelligence (by line, radio or other 
means) noise may be added linearly to the signal (e.g. atmospheric 
noise, thermal noise, shot noise, crosstalk, etc.); thus 


x(t) = z(t) + vo) 


where usually z(t) and 2(f) are statistically independent. 

(b) Also in transmission, a signal can become distorted due 
to non-linearities of the transmission system; in this case noise 
is added in the form of distortion components; thus 


x(t) = 2() + blz(@)] « (11) 


where h is a function of z, often instantaneous, containing no 
linear terms. In this case z(¢) and h[z(7)] usually possess some 
correlation. 

When the filter is optimized as a predictor z(t) is the desired 
output of the predictor and is normally expressed as a function 
of x@¢ + T): 


(10) 


2(t) = f[x(t + T)] (12) 
or, more commonly, 
a(t) = x(t + T) (13) 
For the class of filters 7, 
yO =| Kpe-d, dr 2 2. 
0 


and the mean-square error is 


i iy (oa) 2 
{\ K[x( — 2), T]dr — 0} dt 
10) 


1 

5: 
e* = lim — 
T->0 2T —T 


Changing the order of integration and expanding the integrand, 


2 =[" [° Kp@ ap nIKRO — 7p, ltr 
aA) 


— 2| K[x(t — 7), t]z()dr + 2(1)? (14) 
0 
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If x(7) and z(t) are members of ensembles X(t) and Z(¢) which 
are both ergodic, time averages may be replaced by ensemble 
averages, giving :* 


(ce) ioe) 
=| [| | Kee, WKGo, 72)pale1, x25 72 — Wdeideadr dry 
0 ~0 


ys, bei | Kix, Wepilx, 2: vidededs 


4 22 (15) 


where p(x1, >; T) is the second probability density of the process 
X(, and p,(x, z; 7) is the joint second probability density of the 
processes X(f) and Z(t). The minimization of this expression is 
achieved by using variational calculus. 

Let K(x, 7) be changed by an amount eH(x, 7), where « is a 
small parameter and H(x, 7) is an arbitrary kernel function 
subject only to the restriction that it must represent a physically 
realizable filter, i.e. H@, 7) = 0'7 <0: 

The variation in e? is given byt 


Ae = 
2¢| i J J K(x, 72), 71)P2%1, X23 T, — T2)dxdx2dT dTz 
0 40 


aX 2<{ J J H(x, Dap,(%,.75 Tdxdeds 


ae 2| I i fac, T JH (Xo, T2)Pa(%1, X23 T, — T2)dxdx2dT dT, 
0 40 


(16) 
By putting 0Ae?/d« = 0 when « = 0, 
f J Hx, “lf | K(x, T)P(X4, X23 7 =F T7)dX dT 
— | ZPo(%X, Z3 nde [desde =) (17) 


Also from eqn. (16) it can be shown that 02Ae?/d<? is always 
positive, so that the stationary value of 2 is a minimum. 
Using the fundamental lemma of variational calculus,° we obtain 
from eqn. (17) an integral equation which gives the implicit 
solution for the kernel function of the optimum non-linear filter: 


é | K(x2, 72)P2(X1, X23 T1 — T)dx_dT, = | ZP2(%1, 23 T,)dz 
(18) 


which must be satisfied for7, > Oonly, since H(x,,7,) = 07; <0. 
Thus the information which is required in order to optimize a 
filter of class 7, is the second probability density of the input 
P2(X1, X23 7, — T2) and the joint second probability density 
Po(x;, Z; T,) between the input and desired output. 


(3.1) Zero-Memory Filters 


The input-output relationship for a zero-memory filter is of 
the form 


yd) = f[x()] = Jt [x —7]8@dr. . | GB) 


ote Throughout the paper, (i) for each integral written without limits, it is implied that 
integration takes place over the whole range of the variable of integration, and (ii) all 
processes are assumed to be ergodic. 

Bigs) he first term in the expression for Ae? is the sum of two terms which can be com- 
bined, since p2(x1, x2; t2 — 1) is symmetrical in the pairs of variables (1, 71) and 
(%2, 72), ie. pa(x1, x23 T2 — T1) = Pa(x2, x13 1 — 72). 


Eqn. (3) shows that zero-memory filters are members of 
more general class 1; and the kernel function in eqn. (7) is 
the form 


K[x(t 7); tT] == fix 7)|5(7) 


Substitution in eqn. (17) for the kernel functions K(x, 73) 
f(x,)5(7>) and H(x;, 7;) = h(x,)6(7,) and subsequent integrati 
with respect to 7, and 7, leads to the equation | 


| hex)| fier, x23 Ode, — | apes, 23 Od |e =0 


whence, using the lemma of variational calculus, 


| Feeprbe, x23 Odx, = ferry, 230d. . G 


But Po(x1, x33 0), = p(xp)d(x, — x,) 


Therefore eqn. (19) can be rewritten 
£(x)p(x) = | #2 2d 


giving an explicit solution for f(x), namely 


J zDo(x, z)dz 
wre wr . . > e . (2 


where p,(x, z)dx6z is the probability that x lies in the range 
to x + dx and z lies in the range z to z + 6z simultaneously. 

Thus the optimum filter function f(x) is equal to the condition 
mean value of z, given x. The information needed in t 
optimization of a zero-memory filter is the whole informati 
content of the first probability density function of the input ai 
the second joint probability density function of the input a1 
desired output with 7 = 0. 


f{@) = 


(3.2) Multi-Path Zero-Memory Filters 


For many processes it is possible to obtain an expansion f 
P2(x, z; 7) in the form [see Appendix 8.1, eqn. (58)] 


Pals 257) = YY Damm POE) - 


where 0,(x) and y,,(z) are two sets of polynomials which a 
orthonormal with respect to p(x) and q(z) as weighting functior 
over the whole range of x and z, respectively, and p(x) and q( 
are first probability density functions of x and z, respectivel 
also 


Dam(2) = || O,(2)ym(2)PAee, 23 Ndxdz ss (5 


whence, by putting 7 = 0 and substituting in eqn. (20), 


fey = Ss Panby() | 2Ym(2)aC@)de 


But ' ZYn(Zq(z)dz =0, mA0;mA~1 
= G2) mm == 1 
=Z m=0 


owing to the orthogonality of y,,(z). 
Therefore f@)\ =>) D:, 8, @)ez+ 2) at See 
n=1 


since, as shown in Appendix 8.1, Dog = 1, A(x) =1 ar 
Don = Dro = 0,740. 


/| It is shown in Appendix 8.3 that truncation of eqn. (21) 
ft the Rth term yields the optimum zero-memory multi-path 


| Fig. 3.—Multi-path zero-memory filter of degree R. 


| The following recurrence relations, derived in Appendix 8.2, 
show that the polynomials 0,(x) can be determined from a 
knowledge of the set of moments of x up to and including the 
nth, and the coefficients ozD,,, may be determined from a know- 
ledge of the moments of x and the averaged cross-products 
between PianuMator Fl; 2% s705 


n—1 
= > Eee) 
8,(x) = = ie (80) 
Eu aap Pa | 
r=0 
and, from eqns. (79), (81) and (82), putting t = 0, 
NZ Ss x"0.(x)D,10, 
CPanel aa. (22) 
Ee ->¥ PI | 
r=0 


A finite number of coefficients D,,, [in eqn. (21)] and constants 
o, and Z can be determined experimentally by analogue measure- 
ments on long samples of the input, x(t), and desired output, z(z). 
The required computing equipment consists of: 

(a) A set of non-linear power-law-function generators, namely 


4 


xe, x2) x4, 2n 


Se 
(A convenient device which will generate all of these functions 
simultaneously is a servo-type multiplier with (2n + 1) potentio- 
meters connected to its shaft.) 

(b) An integrator. 

(c) A set of variable attenuators. 

(d) A set of adders. 

(e) A multiplier. 


The procedure is as follows: 


(i) Measure the set of moments (up to the 2vth) of the input 
process, X(t), by connecting the signal, x(7), to the inputs of the 
function generators and averaging their outputs in turn using 
the integrator. 

_ (ii) Determine the appropriate orthonormal polynomials up 
to the nth using the recurrence equation [eqn. (80)]. 

(iii) Measure the set of averaged cross-products x’z by 
averaging over time using the power-law generators, the multi- 
plier and integrator; then compute the coefficients from the 
recurrence equation [eqn. (22)]. 

Alternatively, set up the first n polynomial-function generators, 
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0,,(x), using the power-law generators, the attenuators and adders 
and measure the coefficients o,D,, directly using the multiplier 
and integrator. For an ergodic process we can express o,D,,; as 
a time average: 


Ue EEN ie {J 6(x)zPo(x, Zz; O)dxdz 


(igh 
= lim spl Gxleoar 


A block diagram of the measuring circuit is shown in Fig. 4. 


MULTIPLIER INTEGRATOR 


z(t) 


Fig. 4.—Experimental method of measuring the constants D,,. 


(3.3) Multi-Path Non-Linear- Memory Filters 


The expression for the output of the multi-path filter of Fig. 2 
can be obtained by substituting in eqn. (7) an expansion for the 
kernel function: 


K[x(@ — 7), 7] = XD &,(7)4,[x(¢ — 7)] (23) 
r=0 
Thus by substituting for the kernel function 
K[x2, T| Se 2 hr)0,G2) : (24) 


in eqn. (18) for the optimum non-linear filter of class n,, and at 
the same time substituting series expansions for p(x,, x2; 7) and 
P2(x1, Z; T) (see Appendix 8.1 and 8.2), namely 


Paris X25 7) = Sp 3 CaCO Om(2)PC)PC) + (83) 
Pari 237) = ES Dud OCedydAradace) « (58) 
we obtain 
EE Y Cpe) x 
r=0 n=0 m=0 
aml (71 a T2) wT 2)dr2 | 6,,,.(2)8.(x2)p(x2)dx > 
= FY Dur ve(xdey | ade. CS) 
k=0 I1=0 


™T1 20 
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But J 0,,(X2)0,(x2)p(x2)dx. =1 m=r 
=0 m-~r. 
and | zy(zq(z)dz =0 140;141 
== ey al 
paige, When) 


Using these results and equating coefficients of 6,(x,) for 
O0<n< Rin eqn. (25), 


R ioe} 
EJ Coml(rs — Tenlraddr = Durdez - 26) 
m= 
7™>0 I iad Wie 8). 6 
and when m =n = 0 
ioe) 
[wolraddr, = 2 (27) 


In general, the optimum kernel function can be expanded 
only as an infinite expansion [eqn. (24)], but in order that a 
solution for the optimum weighting functions w,,(7) can be 
found, truncation of eqn. (24) is necessary, thereby yielding a 
finite number, R, of simultaneous integral equations [eqn. (26)], 
in which, in general, the weighting functions w,,(7) for m< R 
will depend on R. 

It is shown in Appendix 8.3 that the weighting functions 
given by eqns. (26) are those of the optimum multi-path memory 
filter of degree R. 

The simultaneous integral equations [eqn. (26)] can be solved 
by the method of undetermined coefficients® (see Appendix 8.4). 
It seems that often there is a unique set of solutions, provided 
that additional restrictions on the weighting functions are 
imposed to ensure physical realizability in a sufficiently strict 
sense (see Appendix 8.4). However, if a set of solutions to 
eqns. (26) is not unique, then, at least, it can be shown to give a 
mean-square error which is no greater than for any other set 
(Appendix 8.3). 

The multi-path filter of class 7,2 can only approximate to a 
general filter of class ,, because for physical realizability the 
number of paths, R, must be finite. For a large value of R 
more information is required to optimize the filter until in the 
limit as R tends to infinity the whole information content of 
P2(X;, X2; 7) and a large part of the information in p,(x, z; 7) is 
required, i.e. all the coefficients C,,,,(7) in the infinite expansion 
of p2(x;, x2; 7) and D,,,(7) in the infinite expansion of p(x, z; 7). 

The coefficients C,,,(7) and o,D,,(7) can be determined 
analytically as described in Appendix 8.2 given sufficient 
statistical information concerning the input and desired output, 
or they can be determined. experimentally by making analogue 
measurements on long samples of the input, x(f), and desired 
output, z(f). The required computing equipment is the same 
as that listed in Section 3.2 except that a cross-correlator is 
required in place of the multiplier. A very suitable cross- 
correlator for this purpose is described by Lampard.’ The 
procedure for making the measurements is as follows: 

(a) Measure the set of moments of the input signal, x(f), up 
to and including the 27nth moment, using the power-law-function 
generators and the integrator. 

(6) Determine the appropriate orthonormal polynomials up 
to the mth using eqn. (80). 

(c) Set up the first » polynomial-function generators, 6,,(x), 
using the power-law generators, the attenuators and adders. 

(d) Measure the auto- and cross-correlation functions between 
the outputs of the function generators taking two at a time. 


OF A CLASS OF NON-LINEAR FILTERS 


These functions will be the coefficients C,,,,(7), since, fr 
eqn. (84) for an ergodic process, 


Com) = {Onl On(2)Paxle X25 yg 


1 T 
= lim 57 J bone + dee 


(e) Measure the cross-correlation function between each o 
put of the function generators and the desired filter output, z 
These correlation functions will be the coefficients o,D,, 
since, from eqn. (59) for an ergodic process, 


o-Dmi(7) = | | O,()2pxax, 25 Ddede 


— iin 
I> « 2 


1) 8, 6,[x(]z(t + r)dt . ( 
n#~O0 


The measuring circuit is shown in Fig. 5. 


=| 


CROSS-CORRELATOR 


Fig. 5.—Experimental method of measuring the coefficients Cri 
and Dryi(t). 


Although for many cases of input and desired output a n 
linear filter may be capable of giving considerably smaller me 
square filtering error than can an optimum linear filter, there 
situations in which the performance of an optimum non-lin 
filter is no better than the optimum linear filter. 


The Gia) =0 whenn=1 m1 


and m=1° nsH~d , 
and Dyn fT) = 9 whenm=1 n-1 
qc (0) 
eqn. (26) can be satisfied by putting 
Ot) =0 . m=253 hs 
and f¢ Cy(t, — T2)w(72)dT2 = Di(7)o, . « | 
Te oe 


Therefore w,(72) is given by the solution of the Wiener—H 
integral equation [eqn. (31)], in which 


C,\(7) = (Posi, } 
D,\(7) = Px2(T) 


* These conditions define a class of probability density functions p2(x1, x2 T) 
p2(x, z, t). This class is not empty and some of its properties are discusse 
Rorereaies 8, 9 and 10. 


( 


ire the normalized correlation coefficients. Although we do not 
jrove that this solution is unique, it does give a value of mean- 
quare error which is as small as for any other solution 
Appendix 8.3). Thus the optimum filter of class n, can be a 
inear filter, and the only information which is needed for its 
Binization is the auto-correlation of the input and the cross- 
orrelation of the input and desired output. 


(4) SOME NOJSE-FILTER EXAMPLES 

Two examples are worked out to illustrate the method. In 
he first example the signal and noise can have the same spectral 
lensity, but since only a zero-memory filter is optimized the 
pectral density (or auto-correlation coefficient) is not required; 
or the case in question it is found possible to determine the 
xact zero-memory-filter function from eqn. (20), and for com- 
yarison the multi-path zero memory filter is also derived, so 
hat some indication of the degree of approximation is obtained. 
n the second example, a multi-path memory filter is optimized 
or the condition when the signal and noise have the same 
pectral density, a significant improvement in mean-square error 
cing achieved. 

Whilst examples are worked out from statistical data, it may 
lappen that not enough information about the statistical proper- 
ies of the signal and noise is available to enable the filter to be 
ptimized ; however, if long enough samples of signal and noise 
re available, then, using the analogue measuring technique 
lescribed earlier, sufficient information can be obtained in a 
uitable form to allow direct optimization of a non-linear filter 
f class 7;r. Whilst there may be practical difficulties involved 
n obtaining sufficient accuracy of measurement, it is considered 
hat a computer of comparable accuracy to that of Pace* should 
e adequate to enable these analogue measurements to be 
nade. 

Under certain circumstances it may be possible to find a 
impler and better non-linear filter belonging to some class 
ther than 7,, in which event it would obviously not be 
esirable to use the optimization procedure proposed here. 
‘though the proposed method does not always produce the 
implest solution, it can be used for almost any condition of 
yut and desired output and will always produce a physically 
ealizable filter which is better than the best linear filter, except 
1 the special circumstances [eqn. (30)] when the optimum filter 
f class 1, becomes linear. 

In all the following examples we shall optimize no more than 
he first five paths of the multi-path filter. 

For the first example we take an input consisting of a linear 
ddition of signal and noise, where the signal is a random 
quare wave which can have two values +K or —K, each occur- 
ing with equal probability (i.e. the signal has zero mean and 
ariance K2). 


pz) = 3[5 — K) + 62 + K)] (33) 
The noise is Gaussian with variance o* and zero mean. 
1 vw 
pv) = (Mame. Oh (34) 


The optimum zero-memory-filter function is found first and 
men the multi-path zero-memory filter is optimized and the 
esults are compared. 

Since the noise and signal are independent and added linearly, 


re z)p{z ) 


o(z — 
= ayena 

* Precision Analogue Computing Equipment. 
VoL. 107, PART C. 


po(x, z) = oe 


K) + 8+ Ke 
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The optimum zero-memory filter is given by eqn. (20); hence 
K 


na [ 4 Ee 218 — K) + 8 + Ke de 
create K) + 8(¢ + K)le~ “52 dz 
2 a re ey 
Ai epee + paces 
= Ktanh 53 Ma lie Plies ths, Wet ees, pee) 


As K/o increases the error decreases. For small values of 
K/o there is a high probability that x will be negative for a 
positive value of z, such a condition giving rise to large error 
(see Fig. 6). 


Teac 


0-707 


4 


Fig. 6.—Graphs of K tanh Kx/o2 and the optimum multi-path 
zero-memory-filter function for three different values of K/c. 
K24+o02=1 


The multi-path filter may be optimized as follows: 
The first moments of a Gaussian signal with zero mean and 
variance o” are given by 
v=o" xX1x3x 
v=0 nodd 
The first moments of a random square wave with amplitude K 


are z” = K" for n even and z” = 0 for n odd. For simplicity 
we will let K? = o* = 4s0 that the variance of their sum is unity. 
Using eqn. (73) Conca 8.2) the moments are found from 
x(t) = z(t) + v(t) to be 


AE res 4k Vue 


Sy xe 2 — 1) even 
. (36) 


whereas odd moments are zero. 
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Using the recurrence formula 80 [eqn. (80), Appendix 8.2] 
the polynomials are found to be 


A(x) = 1 } 
6,(x) =X 
— 1 
O4(x) = ap 
(5) 
Xe = eS 
03(x) = a 
: 14 13 eo 
(Nes ree oe 
OG 3 xX 6 
84(x) 53, 1/2 
G 
96 233 
Sees ke 
, ie Ty 
Bhs 1522, 12 
=e) 


The coefficients D,, are then found using eqn. (81) in 
Appendix 8.2 for m = 1, 3 and 5: 


1 eons jane remade Ei 
a/2 4/26 31 4/134/761 
To compare this result with that obtained earlier [eqn. (35)] the 
polynomial represented by the expression 


Y = [Dy9,%) + D3193(x) + D5105(x)]o, 


is worked out and plotted on the same axes as the curve 
y = K tanh Kx/o* when K = 1/4/2 and o? = 4 (see Fig. 6). For 


these values of K and o”, 


y = 0-798 3x — 0-1643x3 + 0-011 83x5 (40) 


Since in this case K/o (= 1) is relatively small, the optimum 
zero-memory filter is far from perfect. The performance index 
(Appendix 8.3) has been worked out for the multi-path zero- 
memory filter for 1, 3 and 5 paths (Fig. 7) using eqns. (105) 
and (113). 

The curve of Fig. 7 suggests that an increase in the number of 


Dia Daas (38) 
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X 
ae ee 
K/o= 1 
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0-7 
0-6 x 
Ir K/o=2 
ov 
RS 0-5 
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0-4 
03 
x 
02 
K/o= st 
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0) | Se ee 
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NUMBER OF PATHS 


Fig. 7.—Performance index for the first problem. 


paths would not reduce the performance index below abo 
0:90 when K/o = 1. Fig. 7 also shows the performance ind 
plotted for two other values of K/o. The polynomial filt 
functions for these cases are given below and plotted in Fig. 


K/o=2 y=1:5530x — 0-641 1x3 + 0-08759x> . @ 
KJo=4  y =1:8377x — 1-071 6x? + 0:2378 .  @ 


It is to be expected that very much better performance y 
be obtained for larger values of K/c. 

This first example is almost trivial because the exact optimu 
zero-memory filter can be found using eqn. (20), and hen 
there is no advantage in optimizing the multi-path zero-meme 
filter in this case. The problem has been worked out primar 
to give some indication of the nearness of the approximati 
(see Fig. 6) obtained by using only a few paths. 

For the second case-of-signal and noise the optimum mul 
path filters, both-with and without memory, will be four 
Consider a noise v(t) which is Gaussian (of zero mean and wu: 
variance) and a signal which is obtained by passing Gaussi 
noise through a non-linear-function generator of the form 


H[s()] _ s? —1 
V2 af 2int 


where s(f) is Gaussian noise of zero mean and unit variance t 
uncorrelated with v(f), and H, is the second-order Herm 
polynomial. 

From eqn. (125) it can be shown that z(f) has zero mean a 
unit variance and that if the correlation coefficient of s(t) 
exponential so also is that of z(f).* The reason for choosi 
this method of generating the signal lies in the lat 
property of non-linear Hermite-polynomial generators w. 
respect to a Gaussian input. Since the correlation coefficic 
of z(t) is both known and simple, we can choose the cori 
lation coefficient of v(f) to be equal to that of z(f), with 1 
object of making a comparison between the optimum non-line 
filter and the optimum linear filter, which for this case becon 
a simple attenuator. Unfortunately, optimization of a ze 
memory filter using eqn. (20) leads to integrals which are r 
easy to evaluate (except by numerical methods); however, t 
difficulty does not prohibit us from finding the optimum mul 
path zero-memory filter by the method outlined in Appendix 8 

The moments of Gaussian noise, v(t), with zero-mean and u 
variance, are 


(aise (13 


om=1xX3x5x...(n—1) neven 
=) n odd 


and the moments of z(f) (as calculated in Appendix 8:5) are . 


z7=0 2=1 73 


2:828427 z4=15 } 


2D = 96-166522 7° =755 Zz! = 6983-386571 -¢ 
z= 74417 2° = 897799:3379 z!9 = 12096873 


The moments of x(t) are obtained, givent 


i) Ochs 


by using eqn. (73), Appendix 8.2: 
6 x° 


ma 0 = 1 te 22 x 1350 . 
x? = 822 x8 = 6202 x9 = 52552 x! = 499194 
* Both the second probability density functions of the signal and noise are of ¢ 


A. (See Appendix 8.1 and 8.5, and Reference 11.) 
{+ +2 is introduced here for convenience so that x2 = 1. 
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formula [eqn. (80), Appendix 8.2] giving 
: 


“Oalx) ~ ) 


By) = 

|) = 40? — x — 1) 

| 1 es) ‘ 2 . . 

030) S067 OF ~ 3° 75x? — 2-25x + 2-75) 

| 1 i (46) 
1p. eee Ee 3 , 2 

Pa) aE 64G9 — 8204 1x? + 5-2653x 

|= + 21-9592x — 3-061 2) 

lig) — 1 She fi 4 F 3 

950) = iascg7g5"° — 14-4628x* + 40-082 6x 


Diy = 0-707 1 

| Ds, = — 0-0875 GE 
Ds, =0-0617 

'The final polynomial then becomes 

y = Dy9,(x) + Dy 8X) +... Ds195(%) 

Therefore y = 0-0005x> — 0-008 7x* + 0-018 2x3 

+ 0-247 8x? + 0:4774x — 0-2246 (48) 


This function has been plotted in Fig. 8. The optimum multi- 
‘path memory filter can be found by calculating the set of cross- 
and auto-correlation coefficients between pairs of all the outputs 
of the non-linear function generators, 6,(x), and the desired 
output, z(f). The method outlined in Appendix 8.2, although 
of routine nature, does take a considerable time. The results 
are quoted here for a maximum of only three polynomial 
generators.* 


Cu) = pl) Cyl) = Cul) = 0 
ez) =-0-25[p(r) + 3p°(r)] 


0:75 
C2307) = Cx) = 5g ql— PO + 0°) 


—1-3460p(r) + 1-346 0p%(r) 
49-25 

0-5625p(z) + 15-1875p%(r) + 21p%(7) 

36°75 


C24(7) = Cy2(7) = 


me )= 


1-008 89(r) — 26-752 2*(r) + 25-676 1p3(7) 


C34(7) = Cy3(7) = 149-2817 


1-809 7p(7) + 71-1243p7(7) + 252-5364p3(r) + 282-087 5p4(7) 


where the input auto-correlation coefficient p(t) = e~*l"l. 
Eqns. (26) are then solved by the method of undetermined 
coefficients (Appendix 8.4) to give the weighting functions 
w,{T). 

For only two parallel paths (it so happens that the two integral 
equations are independent in this case), 


cw4(7) = 0-707 18(r) 
(7) = 0-276 18(7) + 0-222 2Be- 119526 
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Fig. 8.—Zero-memory-filter function for the second problem. 


OA camera Nee ait fe 4 (49) 


Csa(7) = 606-3906 


D,,(7) = 0-707 1p(7) 
D3,(t) = 0-353 6p(7) 
D3,(t) = — 0-08 75p(7) 
D4y(t) = — 0-038 6p(7) 
Ds,(7) = 0:061 7p(7) 
* The input signal, x(*), has a second probability density function for which 
Cin(t) = Cn(z) = 0 [see eqn. (30)] although Dni(t) 4 0, so that the optimum filter is 


not linear, but the integral equation for «(t) is independent of the rest of the integral 
€quations, 


(50) 


02768(7,) +0222 p68 7 


Fig. 9.—Optimum two-path memory filter for the second problem. 
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For three parallel paths, 
w(t) = 0-707 16(7) 
w(t) = 0-276 68(r) + 0-219 SBe~1-71286< 
+ 0:0008Be—2-37'70= +> 0 
=0 7<0 
— 0:06248(7) — 0-075 8Be—!-71288< 
P 0-012 6Be—2-37178 T > 0 
= ()) g@Zy 


i (2) 


w;(T) = 


J 


The block diagrams of these filters are shown in Figs. 9 and 10. 
The performance index is calculated (Fig. 11) using eqns. (105) 
and (113) for the zero-memory, filter and eqns. (110) and (113) 


for the memory filter. 


02778( +) +0219BE 
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Fig. 10.—Optimum three-path memory filter for the second problem. 
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Fig. 11.—Performance index for the second problem. 


The curve of Fig. 11 suggests that the performance index 
cannot be reduced below about 0-72 using a zero-memory filter 
or 0:70 using a memory filter. These results, whilst not perfect, 
should be compared with the figure of unity for the best linear 
filter. It is interesting to note that, by using only one zero- 
memory non-linear filter path, 0,(x), considerable improvement 
is achieved as compared with the linear filter, and little additional 
improvement is effected by increasing the number of paths or 
by using a memory filter. Thus, in this example it seems that 
fairly rapid convergence to the optimum filter of class 7, is 
obtained; this condition, whilst common, may not necessarily 
occur with other types of input. 


(5S) CONCLUSIONS 


A routine procedure, unhindered by the need to evaluate 
difficult integrals, is proposed for the optimization of a class of 
memory filters which are realized as a number of parallel paths. 
The physical complexity of the filter increases rapidly with the 
number of paths, but for many applications the performance 
index of the filter converges rapidly to a constant value as the 


number of paths increases. Although the analytical optimi 
tion is tedious, applications can be visualized where the labo 
and complexity might be justified; in particular, it has 
demonstrated that, if signal and noise have the same spec 
density, although a linear noise filter cannot produce a significa: 
improvement in mean-square error, a multi-path noise filter 
do so. 

Multi-path zero-memory filters also can be optimized usi 
this procedure, which has advantages over the more dir 
method, since, although an optimum zero-memory filter functic 
can be expressed explicitly in terms of probability functior 
[eqn. (20)], this expression involves integrals which often mé 
be difficult to evaluate. In contrast to the labour involved i 
the optimization of the multi-path memory filter, a multi-pat 
zero-memory filter can be optimized very easily analytical! 
(taking about four hours for an average problem). 

It is probable, in practice, that insufficient information abot 
the input will be available and it will be necessary to obtain th 
information from long samples of the input and desired outpu 
A method is proposed whereby the information can be obtaine 
in the required form by making some measurements using a 
analogue computer. Although no results have been obtaine 
in this way, no new experimental techniques are involved an 
the only essential requirement is a sufficiently large and accurat 
analogue machine. 

It may be necessary to employ a digital computer as well fc 
the solution of the set of integral equations [eqn. (26)]. TE 
example in the text was calculated laboriously by hand, but thet 
seems no reason why a digital machine could not be programme 
to do the same task in the same way; if it should be require 
to solve more than two simultaneous integral equations, a digit 
computer would be almost indispensable. 

Multi-path memory filters may be connected in cascade t 
produce still lower mean-square error. If an analogue com 
puter is available, a second multi-path memory filter can t 
optimized to operate on the output of an earlier filter; the mear 
square error generally will be reduced, because the combine 
filter is of higher classification!:? than a filter of class ;, an 
can use more statistical information about the signal and nois 
As the number of optimum cascaded filters is increased, th 
mean-square filtering error will converge to either zero or som 
finite value which must be Jess than, or equal to, the origin: 
mean-square error at the input to the cascaded chain. Thus th 
last of a large number of filters in the chain will not make a 
appreciable difference to the mean-square error; intuitively, 
seems that this filter will become a direct connection, whence 
in eqns. (26) and (27), ; 


Wy(7) = 0 m= Ii 
= 0,0(7%). 9m =A (5: 
= VOM>) 70 


Therefore the coefficients of the probability-density expansio 
for the output and desired output satisfy 


a, G1 (7) = Dai@e-r 
ye 


(5¢ 


If the input and desired output are such that they satisfy thi 
equation, the optimum filter of class 7; is an attenuator. 

A cascade connection of two optimum zero-memory filter 
is itself a zero-memory filter, so that it is to be expected tha 
the second of the two filters will be a direct connection, sinc 
together they cannot produce a lower mean-square error than th 
first alone can achieve. However, we can cascade a zero-memor 
non-linear filter and a linear memory filter alternately an 
produce successively lower values of mean-square error. Suc! 


i 
| 


ie arrangement may have considerable practical advantages 
| wing to the relative simplicity of the optimization procedure 
or both zero-memory and linear memory filters. 
i The optimization procedure proposed in the paper can be used 
_|for the analysis of non-linear systems. Given a non-linear box 
Jof class 7, it is always possible to construct an approximate 
multi-path filter of class 7;r (approximate only in the sense that 
Ris finite). Experimentally, the analysis of the box can be 
| made by measuring the cross-correlations and auto-correlation 
7 lof all the outputs of a set of Hermite-polynomial generators and 
the non-linear box output, having connected a common Gaussian 
input to them all (see Fig. 12). A multi-path filter is then 


NON-LINEAR BOX 


CROSS—CORRELATOR 


Fig. 12.—Experimental analysis of a non-linear system. 


‘optimized to have an output as nearly as possible equal, in the 
| mean-square-error sense, to the output of the non-linear box. 
|The reason for the choice of Gaussian input and Hermite 
polynomials is that this produces integral equations [eqn. (26)] 
|which are independent, thereby simplifying the mathematics, 
‘and also that a wide-band Gaussian input is sufficiently 
‘random for the unknown box to be completely characterized 
by the above measurements. Even if the unknown box is 
not of class 7; an approximate equivalent filter of class 1p 
can be found, and may be useful in the approximate analysis 
of more complicated non-linear systems. In this case the 
| equivalence does not necessarily hold for different inputs and it 
'may be better to find the equivalent filter under the same input 
conditions as those of the original non-linear box. 
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(8) APPENDICES 
(8.1) An Expansion of Second Probability Density Functions 


Let x(t) and z(t) be random functions of time belonging to 
the ensembles X and Z for which the first probability density 
functions p(x) and q(z) are known. 

Suppose there exist two sets of polynomials 6,(x) and y,,(z) 
which are orthonormal with respect to p(x) and q(z) respec- 
tively as weighting functions, over the whole range of x and z 
respectively : 


| 0,(x)0,(x)p(x)dx == 83 | 


(55) 
[v@v@a@ae = 6; | 
where 6, =1 i=) 
=0 iF-j 
The first few of these polynomials are easily identified, since 
J p@)dx = 1 
=f xp(x)dx (56) 
and o% =f (x — X)*p(x)dx | 


where x is the mean value of x(f) and o2 is the variance of x(f). 
Thence we may write 


fa x1 x p(xdx =1 


ND 
x =) p(x)dx = 1 


and we conclude that 


Oo(x) = 1 


O,(x) = = Es (57) 


and similarly yo(z) = 1 yi(z) = 
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The higher-order polynomials are all obtained from the moments 
of the distributions p(x) and q(z) (see Appendix 8.2). 
The joint second probability density p(x, z; 7) may be expanded 
in series form: 
ioe) eo 
pox, Z; T) I; y, x, Dam(T)O nV m(ZP~)a(z) (58) 
n=0m= 
The restrictions on p,(x, z; 7) to validate this formal procedure 


will not be enumerated. 
Because of the orthogonality of 0,(x) and y,,(z), 


Dam(t) = [| On()ym(2)P2(ee, 25 7)dedz (59) 


The coefficients D,,,,, can all be expressed in terms of the joint 
moments of the form 


ar(t) = #2 = [| x'25py(o, 2; adxde (60) 
and we observe from eqns. (57) and (59) that 
Doo oS 1 Don = nd — 0 nF 0 (61) 
(x —X\(z—-—Z 
DT = ~ = Pxz(T) (62) 


which is the normalized correlation function between x(#) and 
z(t). Certain other properties of these coefficients follow 
immediately. 


1D} 5Ga) a qaian (63) 


D am O) = Snag 


provided that x and z are members of the same ensemble. 


(8.1.1) Single-Series Expansion. 

There exists a class of second probability density functions,!! 
A, of physically realizable processes for which it is possible to 
expand the density function in the form of a single series: 

feo) 
Px, 237) = De Din 7)9(X)ynlZP(~)a(Z) - (64) 
= 


An example is the Gaussian second probability density func- 
tion with zero mean value, ¥ = 0, for which we may write: 


x? + x3 — 2p _ 65) 


1 
Po(%1; %23 7) = InV/C — p?) exp | 21 — p?) 


It can be shown in this case!” that 


exp — (4438) 


2 20 n 
p(X}, X37) = = DAC . (66) 
where H,(x) = (— rere Ze R, (67) 
and J H, (OH (xe7*dx = nhh/Qn) n=m 
dF } (68) 
= nF-m 
Therefore C2) — vag 
Dh ear (69) 
DE) — Von [2 
and Dry) ~~ p"(7) 
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(8.2) Analytical Determination of the Coefficients of a 
Second Probability Density-Function Expansion 


If both the first and second probability density functions 
a process or the first and joint second probability density functi 
of a pair of processes are known, it is usually possible to obtaii 
both the coefficients and the polynomials of the expansion 
that second probability density function or joint second proba 
bility density function as follows. 


(8.2.1) Moments of the Input. 


To determine the appropriate polynomials for a given proces 
(i.e. polynomials which are orthonormal with respect to the firs 
probability density function of the process as weighting functior 
it is necessary first to determine the moments of that proces: 
These can be found from the characteristic function (wu) of th 
random process, which is defined as 


bu) = Ee) = f e*p(x)dx (1 
Thus 7(u) is the Fourier transform of p(x). 
From the characteristic function the moments of the proces 
may be determined using the following formula for the at 
moment: 


Op = X= y-| S400 | 7 


a= 


(a) In filtering problems the filter input often consists of th 
sum of a signal z(r) and noise v(f): 


x(t) = 2() + vf). (7: 
whence Cy =X =(+ zy" 
= Scr (7 
r=0 


when z(f) and 2(f) are statistically independent. 
(b) The input may be equal to the product of signal and nois 


x(t) = z(f)v(1) (7: 
In this case the moments of the input are found more easil 
since 
x" = Zhigh = zy (7: 
when z(t) and 2(f) are statistically independent. 


(8.2.2) Orthonormal Polynomials, 
An orthonormal polynomial of order 7 will consist of a line 
summation of the first m of the linearly independent functio1 
1 Pes ame eA Sst» 
Putting 
such that 


Ao(x) = 1 and O,(x) = ayyx — ayo 


$062) (pCedatx = [@ux — ayo)p(x)dx = 0 


and [62@x)p@a)de = [(quix — a40)?p@)dx = 1 


we find that Qi = 1/0, and 49> X/o,. 


Therefore deal 


A(x) = (7 


Ox 


Let O4(x) = Ag2x? — az,8,(x) — ago 


such that 
Pearce = | fazax? = @19,(x) — ago]6,(x)p(x)dx = 0 


Mio 0oCe)pbehe = [lasnr? — 20460) — amlpeyax —0 


x? — x76,00)0,(x) — x? 


. ©-flaor-er- 
For the general case 
OX) = dyn X” — of yr) . (78) 
where Qnr = ee 
~1/2 (79) 
| am = {= —"S! ULC 
| —"S) £,650,00 
_ Therefore PC — — (80) 


[Erman 


(8.2.3) Coefficients of the Second Probability Density-Function 
Expansions. 
Using eqns. (79) we can obtain a recurrence formula for the 
coefficients D,,,(7) occurring in the expansion of the joint second 
probability density p(x, z; 7) as follows: 


o,D,1(7) ae 0,8,(x)y 1) = O,(x)z n#0 
=0 n=0 
n—1 
Therefore 0D, (7) = ApnX"Z — > nO, (x)Z 
‘Le. 0,Dy,(7) = Ann A ee is nr OD, (7) (81) 
where Aula) = fl xe 25 Dade (82) 


A similar but more complicated expression can be found for the 
Coefficients C,,,,(7) occurring in the expansion of the second 
probability density: 


Paltis X25 7) = BE Cam(V Oa DOC2)Cr)0CH) - (83) 
where Cun(T) = [[ @n(21)mCxadeate 1 X23 T)dx dx (84) 


For convenience we choose to express eqn. (80) in the form 


8,(X) = bax" + byyx?1 + ...4+5x7+ ...4+d9 . (85) 

Then CAT = z= s b,b,%,(7) « (86) 
r=0 s=0 ; 

where oar) = [| x5x$P earn x95 did, (87) 


Therefore all the coefficients C,,,,(7) and D,,(7) needed for the 
optimization can be expressed in terms of the joint moments 
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a,(7) and X,,(7) defined by eqns. (87) and (82) respectively. 
These joint moments can be found from the joint characteristic 
functions; in particular 


ce) ” 
Num(T) = xia Gobs ou? sil Uz;7) . (88) 
where ,,(uy, U2; T) = il eluxtmz)y (x, 2; T)dxdz (89) 


(a) When the input is equal to the sum of signal, z(t) and 
noise, v(t), 


Lam = X~x = (Wy + 2)" + 29)” 
n m 
=>» x "CMCvizt "525° 
r=0 s=0 


and when z(f) and v() are statistically independent, 


n 


Phones SD > REC Fixi zn ee th (90) 
r=0 s=0 
Ney = xz = (M% + 2)%Z 
n a 
=) "C,vizt "22 
r=0 
and when z(t) and v(f) are statistically independent, 
n i 
Ant = 27M Zt "Zo (91) 


(6) When the input is equal to the product of signal and noise 
the joint moments are easily found, since 


Sam (T) = 


ae N=INy N,N 
= 2427 V0 


mM —. of, Ny, yn 
xixh = zhviz5'V5 


(92) 
provided that z(f) and v(t) are statistically independent, and 


An (1) = X22 = Zz (93) 
provided that z(f) and 2(f) are statistically independent. 

It will be appreciated that the labour involved may be very 
great in order to compute first the polynomials and then the 
coefficients of an expansion for the second or joint probability 
density function from a knowledge of either the second or joint 
characteristic function or the probability density function itself. 
Also, when the expansion corresponds to a process which itself 
is the sum of two other processes whose probability densities 
are given [as in (a)] the work is greater still. One such problem, 
the solution to which is given earlier, took about 120 hours to 
calculate the expansion coefficients on a desk calculator. 


(8.3) Mean-Square Filtering Error 
The mean-square filtering error is given by eqn. (1): 
=DP@ — zOP =»0? + AH — 2O2H- () 
Let y(4) = eee where g represents any linear or non-linear 
operation, of finite mean square, on the whole, or part, of the 
past input. 
The variation in e” produced by a variation in g[x] of eh[x], 


where ¢ is a small constant and h[x] again depends only on the 
past of x, is given by 


Ae? = — 2eh@)[g(x) — z] + 2h)? 


72 LUBBOCK: THE OPTIMIZATION OF A CLASS OF NON-LINEAR FILTERS 


For g(x) to be optimum, 


dAe? us ett Sate 
— | == —z] =0 
| 7 ibs 2h(x)[ g(x) — z] 
Therefore h(@®)[g) — z] =0 (94) 
Thence the following two conditions are derived: 
For a chosen variation <h(x) = eg(x) 
g(x)? = zg(x) 
ive. y(t)? = 2(y(0) (95) 
For a chosen variation «h(x) = « x 1 
Be) =z 
i.e. WA = 2). (96) 


Eqns. (95) and (96) are seen to be necessary if g(x) represents 
an optimum filter belonging to any class. 
Substitution of eqns. (95) and (96) in eqn. (1) yields 


(97) 


That this result is true for each of the classes of filters previously 
considered is easily verified. 

Furthermore, suppose there should exist more than one 
optimum solution for g(x); by taking any two solutions g;(x) 
and g,(x) we may write, using eqn. (94), 


e* =a? = Ge. 


&1(x)[Bo(x) <i z] =0 


Similarly g(x)[g\(x) — z] = 0 


whence g\(x)z = go(x)z (98) 


From eqns. (95) and (98) it is seen that the mean-square 
error [eqn. (1)] is the same for each of the two solutions g;(x) 
and g,(x). Thus although a particular solution may not be 
unique we can be sure that it is no worse than any other solution 
in so far as it gives the same mean-square error. 


(8.3.1) The Class of Multi-Path Zero-Memory Filters of Degree R. 


Eqn. (94) gives the sufficient conditions for g(x) to represent 
an optimum filter of any specified class where h(x) represents 
any physically realizable filter of the same class. 

For the class of multi-path zero-memory filters of degree R, 


R 
g(x) = 2 Kemm(%) 
The most general form for h(x) is 
R 
hO) =D kanBaCe) 


Substitution in eqn. (94) gives 


R R 
3 al D KemFn(2)om() = OE | =e. (99) 
Since 000, =0 msn 
=1m=n 
R R 
we obtain D hiker = Ske 8, (2 . (100) 
n=0 n=0 


Since the set of constants k,,, can be given any pre-assigned 
of values only one solution exists, namely 


Ken SOAK Fe. 
ni=)0, 12 


(10 


Also, since 6)(x) = 1, 


ko = gx) = = 20s 


and the optimum multi-path zero-memory filter of degree R i 
given by 


(103 
R 
E[x@] = & oPnPlxO] + 200 - (104 


This equation is identical with the equation obtained by trun 
cating the expansion (21). From eqn. (104) 


a2 = ff ix] = zy =3 o2D2, 


since 6(x)0,(x) =0 mA~An 
=)1 m=n 
Substitution in eqn. (97) gives 
ot R 
po ese | aa = 
mie 031 a o3(1 > Dh) (105 


(8.3.2) The Class of Multi-Path Memory Filters of Degree R. 


R a 
2x) = ¥ | eogm(7)On>te — Daz 
m=0~°0 


The most general form for h(x) is 


R 2) 
h(x) = YJ coin Enlace — ]er 


Substitution in eqn. (94) gives 

R (oe) R co 

Sy I en) >> I Bem(T2) Caml (71 A 72) dr 
=0~0 0~0 


—8, [x(t — DEO bar, ie (106 


Using the fundamental lemma of variation calculus> and notin; 
that all w,,(7) are arbitrary but physically realizable weightin; 
functions of linear networks [i.e. w,,(7) =0 for 7 < O], w 
obtain 


R fee) 
yy; if Wm(T2) Caml (74 im) T) |dr> Te 6,[x(t - 7)z|() =0 (107 
m=0~0 
Kizn0, 2 See 
but since Co, = 0, m0; Com = 1, m = 0; and A[x())] = 1 


| eo(adr = w =F (27 
0 
R (oe) 
and Ef enltdConl —1))]dt,=90, Dyl(t) 120 
haa . « 
From egns. (26) and (27), it follows that 


R R i2) ie 9) 
G=T DO J, ealrdem(t I aml(rr — 72]dridry (108 


n=1 m=1 


R [s) 
or we may write 03 = & | D,,(r)o,a,(r)dr (109 
n=1~0 
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and by substitution in eqn. (97) obtain: 

a & 

P= (1 — 2) =o2— > [ Dyi(7)o,0,(7)dr . (110) 
OZ n=1~0 


\(8.3.3) Performance Index for Noise Filters. 


Even in the most unfavourable conditions when the statistical 
characteristics of the signal and noise are identical, an optimum 
linear or non-linear noise filter will reduce the mean-square 
output error given by the optimum attenuator; for example, 
the optimum linear filter for the condition when the input 
signal and noise have the same spectral densities becomes an 

attenuator. Therefore it is both logical and convenient to 
choose the mean-square error of the optimum attenuator as a 

datum against which to compare the mean-square errors of 
more complex optimum filters. 

The optimum attenuator mean-square error is given by 
eqn. (107): 


| 


e = o%(1 — D},) . (111) 
and the comparative mean-square error by 
a Bo a2 2 
ape ea ein ye ee (112) 
ey See ili 
When the signal and noise are additive and uncorrelated, 
a Beer 
. (113) 
e 1p oe) OX 


where e is the mean-square error of the optimum attenuator, 
é is the mean-square filter error and 0 < e/e2 < 1. 
The two limiting values of e2/e2 represent the extreme con- 
ditions when the optimum filter is perfect and when it is no 
better than the optimum attenuator. If e?/e2 <1 we say the 
improvement in mean-square error is ‘significant’. 


(8.4) Method of Undetermined Coefficients 
_ The solution for the set of integral equations [eqn. (26)] has 
been worked out by Wiener,* and a modification of his method 
‘is formally presented here. 
The integral equations are 


R o 
EL Smnlers = 72om(addra — Dinlrvo, = Hylrs) 26) 


where Hpi Ov t7-0-n —1,2....R 


Taking the bilateral Laplace transform!? of both sides of these 
equations, 


R 
2X Enm(PYY mi) — By(p) = #,(P) (114) 
Ge 
where #%(p) has singularities in the right-hand half of the 
p-plane only [since H,,(7;) = 0, 7; > 0]. 

For simplicity the method will be described for the case of 
two equations only. 


€ (Pp) €12(p) 


Let ip) = ew Cap) oa 
— |Iu~) +A) 12?) 

then D(p)V\(p) = D(p) + Hip) Sop) (116) 

and Hn) = |B BNP TO) | ary 


€r(p) Bap) + Aly) 


Therefore 
D(p)V (Pp) = [F122 — M6 12] + MO x. — HE 12 (118) 
Suppose we can write O(p) = O*(p)O~(p) (119) 


where ®+(p) has poles and zeros in the left-hand half of the 
p-plane only, and ®~(p) has poles and zeros in the right-hand 
half only. Then 


D6 o2 — DE 12 HE 9, — HE 1 
O-(p) O-(p) 
W (p)®*(p) has poles in the left-hand half of the p-plane only 


if W,(p) is the transfer function of a stable system, i.e. 
w,(7) = 0,7 <0. Also, it may usually be assumed that 


limY,(p)®t(p)=0 . . 
po 


W(p)®*(p) = . (120) 


(121) 


It may be necessary to impose some further restriction on 


W,p) so as to obtain a unique solution to eqns. (26). This 
restriction is of the form: 
W. 
lim And) =—CONStanlvmre) ete (122) 
p—> oo 


where for noise-filtering problems normally i = 0 and in pre- 

diction problems normally i= 1. Only approximate physical 

filters for i> 0 can be realized, but the approximation is not 

serious in practice if i is not too large. In any particular 

problem i is chosen so that a unique solution is obtained. 
Assuming 4%;(p)®*(p) is a rational algebraic function, its 

numerator must of a lower order than its denominator. 
Therefore 


W(py®*(p) = E 


16 22 a a 
+ 


ee = curl 
®-(p) + 


®-(p) 
(123) 


where the suffixes ‘+’ indicate that only those terms of the 
partial-fraction expansion representing poles in the left-hand 
half of the p-plane have been taken. 

Since #, and #, have no poles in the left-hand half-plane, 
poles in this half-plane of (#62. — #6, )/D~—(p) must be 
poles of both @, and @,>. We can write 


HC 7 — HE na b aval 
®(p) pate Apap Pee 


where a, b, c, . . . are undetermined coefficients, which may be 
determined by substitution for W,(p) and W,(p) in one of the 
original equations (114). 

A set of simultaneous equations in the undetermined coeffi- 
cients of both W, and W, is obtained by equating residues of 
left-hand half-plane poles of this equation and by using eqn. (122). 


e124) 


(8.5) A Second Probability Density of Class A 


We shall consider the signal developed at the output of a zero- 
memory non-linear device characterized by the equation 


Hs] 5? -1 
/2 /2 


where z(f) is the output, s(t) the input 


z(t) 


(125) 


d" 
pa 2/2 — 52/2 
and H,,(s) = (—1)"e! we ta! 
When the input is Gaussian with zero mean and unit variance, 
the output is a signal whose probability density function 
P2(Z1, 223 T) is of class A. 
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It is always possible to find a polynomial 6,, of order m 
such that 


¥ = 8,,[H2(s)] = Ham(s) (126) 


A proof of this statement is not given here but it can be 
verified for all finite values of m as follows: 
Let 0,, and 0, be two such polynomials, and write 


mn — fi 6(2Z1)9,(Z2)P2(Z 4; 293 7)dz,dz 


where $n is the cross-correlation between the outputs y, and 
¥» of the polynomial-function generators @,, and 0,,, (see Fig. 13). 


‘Fig. 13. Geeta 6, and 0, chosen so that y; = Ho,[s(t)] and 
= H»[s(t)] when s(t) is a Gaussian signal. 


Then 
a IJ Or nLH2(s1)]6,[Ho(s2) ]pols,, 523 T)ds;dsy 
= [Bos DHan(52)P2(s1, 523 T)ds,ds> 
whence bx = 0 n¥~m 
= psin(T) n=m (127) 


where p,,;, is the normalized auto-correlation coefficient of s(t 
But don = Cnn, using the same notation as used in eqns. (8 
and (87), and p,(z;, z; 7) is a member of class A. We 
note that p,,,,(T) = p2,.,(7). 

Therefore 


Cult) ="Pinzkt) 2. 


whence, if p,,..(7) = eI", then p,,,,(7) = e778lrl, The pro 


bility density of z can be obtained as follows: 


1 : : 
p(s) = regres for zero mean and unit variance. 
qT 


s=VV(2V/2 + 1) 
2sds = 2dz 
Therefore 
1 1 
gp — V24+1/2 Pets 

PO = x 758 VeVTET +1), 7 

The moments of z can be found from 
Z = f z"p(z)dz 

or from Z? = es ae So : (130 


given the moments of a Gaussian process with zero mean ant 
unit variance. 


Whence 7=0, 7 = a 
96-166 522, z° = 755-0, z7 
897 799-3379 and 710 — 


Z = 2828427, #=15, P= 
= 6983-386571, z§ = 74417, 2 = 
Doe 


(621,316.57: 621.316,923 
hie 


ry 
(80). 
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SUMMARY 

After a review of the aims and merits of protective devices for 
low-voltage installations, the joint use of h.r.c. fuses and miniature 
\circuit-breakers is shown to be functional in many installations. 
| The hypothesis that the 0-01 sec fusing current forms a criterion for 
' discrimination is subjected to experimental tests and the statistical 
nature of discrimination is analysed. Practical conclusions for 
/ installation design are drawn, and a new fuse series and various other 

' methods for backing-up miniature circuit-breakers are suggested. 


b 


| : (1) AIMS AND MEANS OF PROTECTION 


Two trends have developed in the protection of low-voltage 
\ installations in recent years: the h.r.c. fuse has found wide 
application in England, while the miniature circuit-breaker has 
_ become popular in America. Both these trends have reached 
“South Africa, where, at present, h.r.c. fuses and miniature 
circuit-breakers compete in replacing the semi-enclosed fuse in 
better-class installations. This has raised problems of practical 
layout design which led to the investigations presented in this 
and a previous paper.! 
_ In order to assess the merits of different devices, a brief 
glance at the requirements of protective systems is warranted; 
these are 
(a) Short-circuits must be cleared rapidly by protection devices 
of adequate breaking capacity, and overloads must be disconnected 
after a suitable time. 
| (6) When faults occur, interruption of supply should be localized 
by suitable discrimination, and reinstatement should be rapid and 
convenient. 

Since circuits are numerous in l.y. installations and less 
important than the main arteries of a big distribution network, 
Ly. protective devices should be relatively cheap; discrimination 

is achieved by fuses or circuit-breakers with simple delay 
mechanisms, usually of an inverse time characteristic type, 
 Namely* = 


(a) The high rupturing capacity fuse (h.r.c. fuse)—a cartridge fuse 
with a breaking capacity of at least 15kA (B.S. class AC 3, 4, or 5). 
- (b) The semi-enclosed fuse—a rewirable fuse with a breaking 
capacity below 4kA. 

(c) The miniature circuit-breaker (m.c.b.)—a small air circuit- 
breaker with a breaking capacity not exceeding 5kA. 
ASH The air circuit-breaker, with a breaking capacity of at least 
Fie The oil circuit-breaker, with a breaking capacity of at least 


All these devices can claim a place in different installations 
and, for some applications, several can compete at present. It 
is the aim of the paper to discuss rational principles of selection, 


* These definitions are convenient for the purposes of the paper, although the 
term “m.c.b.’ may be used elsewhere for moulded air circuit-breakers of 10kA breaking 
| capacity and although oil circuit-breakers of less than 10kA breaking capacity are 

‘ made, 
1. ‘L.V. installation’ here defines a 3-phase 400-volt or single-phase 230-volt installation. 
_ “Rated breaking capacities’ are given in r.m.s. amperes at these voltages, while ‘slow’ 
| and ‘fast’ h.r.c, fuses are defined in Section 10.1. 


Correspondence on Monographs is invited for consideration with a view to 
| publication. 
Dr. Einhorn is Senior Lecturer in Electrical Engineering, University of Cape Town. 
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to present recent tests on the discrimination of m.c.b.’s and 
h.r.c. fuses, and to establish a theoretical basis for interpreting 
the results and predicting discrimination. 


(2) SELECTION OF PROTECTIVE DEVICES 
(2.1) Merits of Various Protective Devices 


Circuit-breakers are fundamentally more convenient than 
fuses and allow easy and rapid reinstatement of circuits after a 
fault has been cleared. They are popular for this reason, 
particularly in residential and commercial premises. Protection 
against sustained small overloads by circuit-breakers which 
open at 1-25 times rated load compares favourably with the 
fusing factors commonly attainable. 

The breaking capacities of m.c.b.’s, however, are very much 
lower than those of h.r.c. fuses, with which they compete on a 
basis of first cost, while both oil and air circuit-breakers are 
relatively expensive. 

H.R.C. fuses have excellent breaking capacities and compare 
favourably with semi-enclosed fuses as regards precision and 
fusing factors (about 1-4-1 -6). 

Semi-enclosed fuses commend themselves by cheapness, and 
discriminate easily in front of m.c.b.’s, but they have a low 
breaking capacity and are liable to deteriorate, which necessitates 
a fusing factor of 2. They also lend themselves to easy abuse. 


(2.2) Selection for Breaking Capacity 


The most critical requirement of a sound installation is ade- 
quate breaking capacity. But breaking capacities greatly in 
excess of possible fault currents seem to offer little benefit, and 
convenience or cost become decisive factors where short-circuits 
are limited. A rational choice of equipment will therefore 
depend greatly on its position and on the duty it is expected to 
perform. A previous analysis of short-circuit currents in typical 
l.v. installations! can be summarized as follows: 


(a) Short-circuits at or near the distribution boards of a large 
installation usually exceed the breaking capacity of m.c.b.’s or 
semi-enclosed fuses, but are rare. 

(6) The majority of faults occur at the load end. They are 
appreciably limited by the resistance of the load-end cables from 
sub-distribution board to final load-outlets (LC in Fig. 1). 

(c) The highest short-circuit current due to a load-end fault 
depends very much on the length of this final load cable. Fig. 2 
illustrates the following: 

(i) Almost all single-phase lighting circuits wired with 3/-036in 
cables or less are within the capacity of a 5kA m.c.b., and wiring 
distances in the order of 10ft are sufficient to bring the fault 
current below 2:5kA. 

(ii) Most single-phase power circuits wired with 7/-036 cables 
can also be controlled by a 5kA m.c.b., but minimum distances 
in the order of 20-30ft are required for a 2:5kA m.c.b. (Earth 
faults in 3-phase circuits are not likely to be more severe than 
faults in single-phase circuits.) 

(iii) Phase-to-phase or 3-phase faults, however, can be much 
heavier, and the minimum distance even for a 5kA m.c.b. would 
be 20-30 ft. 

(d) The short-circuit phase angle of most load-end faults is 
small, since they are mainly limited by cable resistance; tests have 
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shown that the breaking capacity of a circuit-breaker is in this case 
relatively high.3 

(e) Behind a small transformer (of 200kVA or less), even a 
fault at the main distribution board will not exceed 5kA; the 
phase angle of a fault which is mainly limited by transformer 
impedance is, however, large. 


MO.B 


SDB: 


BACK-UP DISCRIMINATE 


(ii) ALL HRC DISCRIMINATE 


(© ©. EEE 


Fig. 1.—Circuit of typical l.v. installation. 


M.D.B. Main distribution board. 


S.D.B. = Sub-distribution board. 
M.P. = Main protection. 
S.P. = Sub-mains protection. 
L.P. = Load circuit protection. 


L.C. = Load circuit. 
Lower portion shows protection ranges 


(i) with m.c.b.’s on sub-distribution boards. 
(ii) with h.r.c. fuses on sub-distribution boards. 


20 


30 40 50 
CABLE LENGTH, FT 


Fig. 2.—Minimum cable length for limiting faults to 5kA and 2kA 
(see Table 3 in Reference 1). 
Cable sizes: 3 = 3/-036in; 7 = 7/-036in. 
Fault types: L = Loop; phase-neutral fault. 
S = Severe; 3-phase fault. 


(Earth faults depend on earth-return resistance, are always milder than S-type, and 
probably similar to L-type faults.) 


Typical installation with 0-03-ohm mains. 
— - — Installations with 0-005-ohm mains (severe conditions). 


(2.3) M.C.B. Behind H.R.C. Fuse 


One of the main conclusions of this analysis is that in many 
cases it would be functional and economical to use m.c.b.’s 
on the sub-distribution boards, where their breaking capacity 
is adequate and where convenient reinstatemént of the relatively 
frequent load-end faults is particularly desirable, and h.r.c. 
fuses on the main distribution board, where their high breaking 
capacity is fully exploited and where they compete with relatively 
expensive oil or air circuit-breakers. This layout leads to 
interesting problems of discrimination and back-up protection, 
which can become embarrassing if the choice of equipment is 
casual or if the radical difference between the functioning of 
fuses and circuit-breakers is not understood. 


(3) DISCRIMINATION AND BACK-UP PROTECTION 
(3.1) Requirements 


While discrimination over the widest possible current range 
is desirable with equipment of equal breaking capacity, e.g. 


FUSES AND MINIATURE CIRCUIT-BREAKERS 


h.r.c. fuses on all distribution boards,* the requirements of 
mixed installation are quite different. 

If the fault current is within the breaking capacity of th 
m.c.b.’s on the sub-distribution boards, discrimination is desire 
and the m.c.b. should clear the fault within the pre-arcing tin 
of the fuse on the main distribution board. However, if tt 
fault is too severe for the m.c.b., considerations of safety overric 
the desideratum of discrimination and the fuse in front must a 
as back-up protection and blow before the circuit-breaker 
damaged. These two requirements imply that the current/tin 
characteristics of fuse and circuit-breaker should cross. Fo 
tunately this is always the case, because of the different breakir 
mechanisms of the two devices. A circuit-breaker normal 
interrupts ‘current flow near a current zero, i.e. at the end of 
half-cycle. For a heavy short-circuit one half-cycle, i.e. 0:01 se 
can be taken as a nominal breaking time (see Fig. 3), althoug 
the actual time of current flow depends on the instant of fau 
initiation, which introduces a random effect. An h.r.c. fus 
on the other hand, is able to cut off a large current in a fractic 
of a half-cycle.4 This means that, in the event of a very heay 
fault, discrimination is impossible but backing-up occurs. 


(3.2) Discrimination Criterion 


The question arises: at what fault current does the transitic 
from discrimination to back-up protection take place? TI 
author has previously proposed as a working hypothesis!>* th: 
the discrimination limit is determined by the 0-Olsec fusir 
current. Fig. 3 shows the current/time characteristics of a 


CURRENT 


0:01 SEC 


TIME 
Fig. 3.—Current/time characteristics of slow fuse, S, fast fuse, ] 
and m.c.b., B. 


If the fuse is on the main distribution board and the m.c.b. on the sub-distributic 
board, P (resp. P’) is the take-over point, whereby discrimination occurs for fat 
currents below Jp (resp. Jj) and back-up protection takes place for faults above 
(resp. I(). 


m.c.b., a fast fuse and a slow fuse. The take-over point 
(respectively P’) on the intersection of fuse characteristic and tt 
vertical line for t = 0-01 sec indicates the discrimination limi 
one should expect discrimination if the fault current is below / 
while for larger currents the fuse acts as back-up protection 1 
the circuit-breaker. Take-over is in practice almost independe1 
of the m.c.b.’s normal current rating, but determined by tt 
choice of the fuse. 


(4) EXPERIMENTS 
(4.1) Scope and Method of Tests 


In order to verify the criterion postulated above, discrimin: 
tion tests were carried out on two series of h.r.c. fuses of differes 


, make in the range between 30 and 160amp (for previous tests 
see Reference 1). According to the manufacturers, one of these 
fuse types should be fast and the other should be relatively slow, 
although both are faster than semi-enclosed fuses (see Fig. 7). 
‘| Two makes of m.c.b.’s were used, each of 5, 15 and 30 amp 
| ial rating, their rated breaking capacity being 2:5kA. 
Currents were measured by means of a cathode-ray oscillograph 
and camera, which thus gave a record of both magnitude and 
lwaveshape. (For method of experiments see Section 10.2.) 
' All oscillograms (except initial tests made at low currents for 
| jcetibration purposes) consisted of one or two loops only. They 
"can be classified into six groups (Fig. 4) and are discussed in 
| Section 10.2. 
etl 


(a) (b) (c) (d) (e) (f) 


Fig. 4.—Oscillogram types. 


(a) Full loop (containing the current peak). 
(6) Full loop, with minor loop. 
(c) Stunted loop, with minor loop. 

if (d) Tail-end. 

i (ec) Cut-off (single loop, post-peak). 

(f) Cut-off, with minor loop. 


(4.2) Results 


| The discrimination results are presented in Figs. 5 and 6. 
_R.M.S. short-circuit currents, i.e. 0:71 of the observed peak, 
are plotted against fuse rating; discrimination is indicated by a 
_ cross, non-discrimination by a circle. In spite of occasional gaps 


4 


Ww 


fo 


RMS. FAULT CURRENT, kA 


30 -40...._60. slele) 150 
HR.C FUSE RATING, AMP 


Fig. 5.—Test results on slow fuses. 


Tests where the fuse blew are marked by circles; tests where discrimination was 
obtained by crosses. 

R.M.S. fault currents are defined as 0-71 peak values, scaled from oscillograms. 
Only some of the highest discrimination values and the lowest non-discrimination 
values are shown; numerous crosses below and several circles higher up were omitted; 
136 tests were made on 35 fuses. 


or random overlap this limit of the discrimination region can be 
easily ascertained and is shown by a line. The two lines are 
tedrawn in Fig. 7, which shows for further comparison the 
0-01 sec fusing currents of the two fuse types tested as taken 
from makers’ publications, and a previously established line 
for semi-enclosed fuses. 


(4.3) Discrimination Between Fuses and Circuit-Breakers 


The tests confirmed that discrimination depends on the fault 
current: it was always obtained at values considerably below 
the limit lines in Figs. 5 and 6, and never at currents far above 
it. It was also confirmed that the discrimination limit depends 
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R.M.S. FAULT CURRENT, kA 


30 40 60 100 160 
HR.C. FUSE RATING, AMP 


Fig. 6.—Test results on fast fuses. 


Symbols and comments as for Fig. 5. 170 tests were made on 59 fuses, 


DISCRIMINATION LIMIT AND RATED 0-01 SEC 
9° FUSING CURRENT, kA 


4 
201) 30m 40m m6 100 
FUSE RATING, AMP 


150 200 


Fig. 7.—Discrimination limit and 0-01 sec fusing current. 


—-—-- Limits, redrawn from Figs. 5 and 6. 

0-O0lsec fusing currents, taken from h.r.c. fuse characteristics published 
by makers. 

Semi-enclosed fuses, redrawn from Fig. 6 of Reference 1. 


on fuse rating, but not significantly on the m.c.b. rating [except 
that with 30amp fuses, Samp circuit-breakers discriminated 
better than 30amp ones in the reduced-voltage tests with traces 
of type C (Fig. 4) which is not important in practice]. In many 
practical cases, of course, the relatively high coil resistance of 
m.c.b.’s of small normal current rating can contribute noticeably 
to the circuit impedance, and hence can promote discrimination 
indirectly by limiting the fault current. 

The measured discrimination limits did not agree very closely 
with published 0-01 sec values, being above the published line 
for the faster fuses and below for the slower ones (Fig. 7). The 
opposite direction of the discrepancies speaks against a modi- 
fication of the 0:01sec rule. These discrepancies cannot easily 
be explained without a knowledge of the test methods by which 
different manufacturers obtain their fuse characteristics, and 
might well be due to different test power factors or different 
interpretations of statistical results. 

Although the difference between the two fuse types tested is 
less than expected, it is consistent and large enough to be 
significant: the discrimination limit of the slow fuses lies about 
50% above that of the fast fuses (see Fig. 7). It appears that 
published characteristics can give a useful lead to the choice of 
fuses, but do not permit accurate discrimination design. 
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(4.4) Observations on M.C.B.’s 


An analysis of minor loops in the test oscillograms suggests 
that the m.c.b.’s tested usually clear before the next current zero 
if a fault occurs at least 60° (i.e. 0-003 sec) before this current 
zero, although occasionally minor loops in the order of 0-004 sec 
were observed. 

For the higher current tests a 100amp air circuit-breaker of 
10kA rated breaking capacity was connected in series with the 
test m.c.b. and fuse. One can conclude from an observation of 
the flash emanating from the smaller test circuit-breaker that 
the larger one is quite able to provide back-up protection by 
sharing the breaking duty, although at the expense of dis- 
crimination. According to the published characteristics one 
could not expect the 100 amp circuit-breaker to discriminate 
against a smaller one for short-circuit currents above about 1kA. 

Two of the 30amp circuit-breakers finally failed to operate, 
owing to burnt contacts, after having cleared several short- 
circuits above, and numerous short-circuits just within, their 
rated breaking capacity. 


(5) STATISTICAL THEORY OF DISCRIMINATION 


The random effect mentioned in Sections 3 and 4 will now be 
scrutinized, and an attempt will be made to explain the statistical 
nature of discrimination tests. 

Curve H in Fig. 8 shows the heat produced in a conductor by 


ELECTRICAL DEGREES 
90 60 30 90 60 30 


aie 


1:2 


+0 10 
Sloss-| SRA eC a eee ean eee ements 
ian 
2 
o6 
15 


0-4 


(ERE A ~ SON 25H Lane | NT 
\ 
\ 
\ 
1 
1 


O-2 


m7 
w 
t 


Fig. 8.—Calculated curve for heat developed between time fo and 
current zero at time 27/o. 


Differences of H-curve show heat between any two instants. 

Phase is indicated by i-curve. Inverse-square-root-scale on‘right correlates Hy, the 
0-01 sec melting heat of fuse, with H-curve. Electrical degree scale on top refers to 
circuit-breaker-delay, Tg. 


a sinusoidal current suddenly starting at time fg and ending when 
current is zero at time 27/w. (See Section 10.3 for proofs.) 
Let us assume that a particular fuse will require heat H; to blow, 
and that an m.c.b. in series with that fuse has a delay time T,,, 
which means that all faults initiated between rt, and f, will be 
cleared at 27/w. Since there is an equal chance of the fault 
occurring at any moment in the cycle, the ratio w(t, — t)/7 
is the discrimination probability for this particular circuit- 
breaker, fuse and current (see Section 10.3). 

Fig. 9(a) shows the discrimination probability as a function 
of current, expressed in terms of the 0-01 sec fusing current, J,, 
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Fig. 9.—Discrimination probability as function of current with 
circuit-breaker delay as parameter. 


Curves were obtained from Fig. 8. Abscissa in Fig. 9(a): Prospective fault current 
in terms of Jf. : 
Abscissa in Fig. 9(6): Observed fault current in terms of Jf. 


of a certain fuse, with different circuit-breaker delay times a 
parameter. As an example of how to read the diagram, a short 
circuit current of 1-1 J; has a 35% probability of discriminatiot 
if the delay is 60°. Fig. 9(a) refers to prospective currents; i 
tail-end clearances are excluded, Fig. 9(b) is obtained, whicl 
shows the discrimination probability for currents as observe: 
on test oscillograms (see Section 10.3). 

Fig. 9(a) allows the conclusion that the 0:01 sec fusing 
current criterion is quite practicable, since the drop in the regio1 
of unity is fairly sharp; it is somewhat conservative for moderat 
circuit-breaker delay, since discrimination can occasionally occu 
at very much higher currents, although it is only fully certait 
at a slightly lower current. Fig. 9(b) explains why the result 
of discrimination tests are bound to be ‘statistical’, even whe: 
based on current peaks. The curves are so steep, however 
that a discrimination limit can be ascertained without a1 
extravagant expenditure of time and fuses. 

The minor-loop analysis in Section 4.4 indicates that the 60 
curve is applicable to our tests. 


(6) PRACTICAL CONCLUSIONS 
(6.1) Choice of Fuses for Use in Front of M.C.B.’s 


The use of h.r.c. fuses on a main distribution boar 
with m.c.b.’s on sub-distribution boards is functional, an 
should allow satisfactory operation if a suitable take-over poin 
between discrimination and back-up protection is chosen (se 
Fig. 3). The choice depends on the rated breaking capacity o 
the m.c.b. The author would favour a take-over range betwee: 
40 and 100%; this means that the fuse must take over belo 
the breaking-capacity current of the m.c.b., but should not d 
so below say 40% of this value. The top limit is a matter o 
safety; the lower limit prevents unnecessary loss of discrimina 
tion and is fortunately not critical. 

If this tolerance range is applied to the fuses tested, Fig. ' 
shows that fast fuses are undesirable in circuits below 80amp 
since they lose discrimination already in the region of 1kA 
In circuits, carrying more than 80amp, slow fuses would no 
provide back-up protection for a 2:5kA circuit-breaker. 5k/ 
circuit-breakers would combine well with slow fuses for circuit 
carrying between 60 and 150amp, or with fast fuses for circuit 
carrying 80-200 amp. 


(6.2) Suggestion of New Fuse Series 


It appears that neither series of h.r.c. fuses tested is ideal fo 
co-ordination with m.c.b.’s. This could not be expected, sine 
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‘existing fuse series have been devised for optimizing installations 
in which fuses are used exclusively.* 

1, What then would be the ideal characteristics of a fuse series 
for a main distribution board, if m.c.b.’s are to be used on sub- 
/ distribution boards? 

| Two requirements should be met, namely 


| (a) Correct overload protection of sub-main cables between 
| main- and sub-distribution boards; this determines the fuse rating. 

(b) A suitable take-over point from m.c.b.’s on the sub-distri- 

bution boards, determined by the breaking capacity of the m.c.b.’s. 


Fig. 10 shows two sets of possible characteristics, for m.c.b.’s 
ith breaking capacities of 2-5 and 5kA respectively, with take- 


Fig. 10.—Ideal characteristics for main distribution board fuse series 
for backing up m.c.b.’s on sub-distribution boards. 

Circuit-breakers of 5kA breaking capacity, 

—-~-- Circuit-breakers of 2:5kA capacity. 


JI = Prospective current, kA. 
t = Pre-arcing time, sec. 


F 


Over points placed at 75 % to allow safely for tolerance in charac- 
teristics. These curves present an ideal which would allow quite 
a wide tolerance in execution. Whether fuse manufacturers 
can provide a fuse series anywhere near the ideal is a technolo- 
gical and economic problem beyond the scope of the paper. 
The main requirement seems to be a slowing down of the smaller 
fuses. It might be pointed out that a sacrifice in breaking 
capacity thereby entailed would be quite acceptable in most 
practical applications, since the distribution-board fuse has 
Tarely to cope with currents much above 10kA and can in turn 
be backed up by mains protection of higher rating if necessary. 


(6.3) Back-Up by Air Circuit-Breakers 


Since relatively low-priced air circuit-breakers have become 
available in the 50-100 amp range and with breaking capacities 
in the region of 10-20kA, the exclusive use of circuit-breakers 
has been found economical. Back-up protection and dis- 
crimination are still a problem, however. Fast air circuit- 
breakers can lighten the breaking duties of the m.c.b. behind 
considerably for heavy faults above a certain current, at the 
expense of discrimination of course. It was not the task of the 
tests performed to establish this limit carefully. Published 
characteristics lead to the conclusion that discrimination is lost at 
current levels comparable with h.r.c. fuses. Although the 
inconvenience thus caused is mitigated by the easy reinstatement 
of circuit-breakers, manufacturers might consider the design 
of a series with higher take-over independent of normal current 
‘rating, on principles similar to those suggested in Section 6.2. 


(6.4) Oil Circuit-Breakers 


Most oil circuit-breakers are too slow to provide effective 
back-up protection to an m.c.b.! This means the circuit-breakers 
or fuses on the sub-distribution boards must have a breaking 
capacity to clear faults at their own terminals. There is no 
discrimination problem with oil circuit-breakers on the main 
distribution board, but they come into play very rarely, and 
their high capital cost seems to bear small dividends. 


(6.5) Mixed Distribution Boards 


A number of mixed layouts have been suggested before.! 
To summarize briefly: 

(a) Mixed sub-distribution boards: m.c.b.’s for long load circuits 
and h.r.c. fuses for short runs (see Figs. 1 and 2). 

(6) Mixed main distribution boards: a mixture of slow h.r.c. 
fuses for circuits up to 80amp and fast h.r.c. fuses for heavier 
circuits would give a design based on fuses at present available, 
which is comparable with, though not quite as good as, the fuse 
series proposed in Section 6.2. 

(c) Double protection by m.c.b.’s and back-up h.r.c. fuses, both 
on the main distribution board (with only m.c.b.’s on the sub- 
distribution boards): this promises excellent performance at a 
slightly higher price. 


(6.6) Protection Policies 


It is a matter of policy whether protection devices on sub- 
distribution boards are chosen to cope with faults at their own 
terminals, e.g. h.r.c. fuses [see Fig. 1(ii)], or whether reliance is 
placed on back-up protection on the main distribution board 
for this infrequent task, clearing only load-end faults at the 
sub-distribution board, e.g. an m.c.b. backed-up by a h.r.c. fuse 
[see Fig. 1(i)]. 

With correct take-over design the latter solution implies only 
a loss of discrimination, and not safety. The decision rests 
between the convenience of the m.c.b. but the loss of larger 
sections of the installation in cases of heavy faults, and better 
discrimination, but the nuisance and cost of relatively frequent 
fuse replacements for load-end faults. 


(7) SUMMING-UP 

Miniature circuit-breakers are convenient on sub-distribution 
boards and capable of clearing most load-end faults; busbar 
faults usually exceed their breaking capacity in large installations, 
and h.r.c. fuses or circuit-breakers of relatively high breaking 
capacity are then required on main distribution boards. 

The use of m.c.b.’s on sub-distribution boards with h.r.c. 
fuses on the main distribution board is logical and economical, 
but requires careful discrimination design, so that the fuse will 
cope with heavy short-circuits while the m.c.b. will clear over- 
loads or moderate faults and thereby achieve discrimination in 
most cases. 

The 0-Olsec fusing current can be considered as the dis- 
crimination limit, although a random variation due to the phase 
instant of fault initiation should often result in fortuitous 
discrimination at higher currents. 

Of the h.r.c. fuses tested, the fast type discriminates poorly 
for current ratings below 80 amp, while the slow type when rated 
above 80amp will not provide adequate back-up protection for 
2:5kA circuit-breakers. Characteristics of a series of h.r.c. 
fuses ideal for m.c.b. back-up protection are suggested in Fig. 10. 

Rapid-acting air circuit-breakers can provide back-up pro- 
tection for m.c.b.’s, but their characteristics must be carefully 
chosen lest discrimination be lost with relatively mild faults. 

Mixed distribution-board equipment may sometimes be 
warranted. Double main-distribution-board protection by 
fuses and circuit-breakers together offers excellent design 
possibilities. 
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The author is convinced that both h.r.c. fuses and m.c.b.’s 
have their place in low-voltage installations. The paper is 
intended to aid in the choice and thus contribute to rational 
design. 
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(10) APPENDICES 
(10.1) Fast and Slow H.R.C. Fuses 


‘Fast’ and ‘slow’ h.r.c. fuses are meant to be relatively fast 
or slow in the 0-Olsec region. It is quite possible that a slow 
fuse may have a smaller fusing factor than a fast one, which 
would imply crossing of the two fuse characteristics in Fig. 3. 
A quantitative definition of ‘fast’ and ‘slow’, if wanted, can be 
obtained from the relation: 


Io.o, = KTP? 


One could, for example, agree to call fuses ‘slow’ if k > 7, 
‘fast’ if k < 4, and ‘medium’ if 5 <k <6. With this definition, 
the two fuse types tested (see Figs. 5-7) would form border-line 
cases. For them, as probably for most conventional fuse series, 
k is substantially constant (between 4 and 4-5 for the faster, 
between 6 and 7 for the slower, types). 

Table1, based on the above definitions, might aid in installation 
design. 


Table 1 
RECOMMENDATION FOR CHOICE OF FUSE SPEED 


Fuse rating, amp 30 40 50 60 80 100 125 150 200 
For2"SkA mic:b. vs s s,m s,m;f sim, f mf f vf vf 


For 5kA m.c.b. .. vS vs s Ss Ss sims? ‘Shimeat mst 
s = Slow. 
m = Medium. 
f = Fast. 


vs and vf = Well within the slow or fast region. 


(10.2) Experimental Method 


A short-circuit was applied to the test h.r.c. fuse, the test 
m.c.b. and a 4-terminal resistor of 0:01 ohm (resp. 0:05 ohm 
for smaller currents), all connected in series (Fig. 11). The 
voltage across the 0-01-ohm resistor was applied to a cathode-ray 
oscillograph, for recording the current magnitude and waveshape 
photographically. 


HR.C. MCB. 


ACB. 
(a) 


Fig. 11.—Test circuits. 


(a) For currents exceeding 0:8kA, directly from substation busbars. 
(6) For'currents less than 1:2kA, via 230/18-volt transformer. 
OS = Operating switch. 
T = Time-sweep relay. 
C = Current-closing relay. 
Ry = Current-limiting resistor. 
R»w-= Four-terminal measuring resistor. 


Tests up to about 1:2kA were performed in the laboratory) 
over a 230/18-volt 2kVA transformer; the current was con 
trolled by two variable transformers in series from 380 volt9 
[Fig. 11(6)]. Tests between 800amp and 3:5kA were per 
formed in the 600kVA substation, directly from a spare oil 
circuit-breaker on the 230-volt main distribution busbars/} 
the current was adjusted by limiting resistors of copper wire 
[Ry in Fig. 11(@)]. 

The oscillograph time sweep through relay T, and the closing 
of the short-circuit current through the suitably delayed relay C} 
were both initiated by the hand-operated switch O.S. 

The current scale was adjusted by a decade box potentiometet) 
and precalibrated by means of ammeter and current transformer} 
at lower currents. Extrapolation of current calibration to very, 
much higher values was unavoidable, but the accuracy thus 
achieved was adequate for short-circuit tests where random 
effects enter and high accuracy is not needed. 

The currents recorded in Figs. 5 and 6 were based on the 
peaks observed. Fig. 4 shows that, in cases (a) and (4), but not 
in cases (c) and (d), they correspond to the short-circuit currents 
as determined by the network impedances which, in fuse- 
engineering parlance, are called ‘prospective’ short-circuit 
currents [although the term ‘retrospective’ would seem to suit the 
tail-end case (d) better]. In cases (a) and (e) the fault occurs 
at an instant of rising current, while in the other cases it occurs 
when the current falls, often resulting in a ‘minor loop’. The 
occurrence of (b), (c) or (d) depends on fault instance, on circuit- 
breaker speed and on current; in the infrequent case (c) the ar¢ 
resistance curtails the current peak. In case (d) discrimination 
was usually achieved. Cases (e) and (f) are typical for fuse 
operation. Cut-off after the peak, as shown in (e) was usually 
observed in our tests; cut-off before the peak, as shown in (/) 
and in fuse publications,* is typical for currents considerably 
above the discrimination limit. 


(10.3) Statistical Analysis 


In Fig. 8, if a sinusoidal fault current, i = \/2 sin wt, starts 
at fault instant fg and is terminated by a circuit-breaker at a 
subsequent current zero at time 27/w, the heat developed in a 
fuse element of resistance R is 


2n/la 


H= R| dt = 2R/? | sin? wtdt 
fo 


= RP[I/f — to + (1/2) sin 2wto] 


The heat, Hy, required to blow a particular fuse is a constant 


“accuracy. 
) probability with current. 


n case of a heavy fault, since the time is too short for any 
\ppreciable loss of heat. 
Discrimination will occur if H < H;,, and the fuse will blow 
fH > H;. The H-function shown in Fig. 8 is therefore closely 
related to the discrimination limit. 
| The probability of discrimination can now be reasoned out 
as follows: 
| There is an equal chance for the fault incidence (fy) at any 
(nstant in the cycle. Let us normalize the H-scale by taking the 
heat from a particular fault current in half a cycle as unity. 
If then H = H, for fault occurring at time ¢’, discrimination 
pecurs when fp > ?¢’, but the fuse will blow when fy < ¢’ (see 
Fig. 8). 

If the circuit-breaker delay time is 7, (which can be expressed 
in seconds or in electrical degrees) and t; = m/w — Ty, tp = 


\2n/w — T,, the discrimination region is between ?¢’ and f,, and 


the non-discrimination region between ¢, and ¢’. The dis- 


crimination probability is then 


lig 0 ots 8) 
‘| (t, = t1) T 


and can be obtained graphically from Fig. 8 with adequate 
Fig. 9(a) shows the variation of discrimination 
The abscissa is obtained by defining 
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fusing current (see right-hand scale in Fig. 8). The current / in 
Fig. 9(a) is a prospective fault current, and corresponds to the 
actual observed current in cases (a) and (b) of Fig. 4, but not in 
case (d). The discrimination probability of actually observed 
currents [ignoring the ‘stunted’ current case (c)] is given by 


1:57/w — ¢’ 
0-Sa/w + T, 


since tail-end short-circuits [case (d)] are counted in a lower 
current category. 

One can also deduce from the H-curve in Fig. 8 how much 
the instant of fault incidence would affect the current/time 
characteristics of fuses. It can be seen that this variation, 
although zero for the 0-01sec point itself, can be quite large 
in its vicinity and that different interpretation of statistical 
results can at least partly explain the discrepancies in Fig. 7 
between the test results and curves culled from published fuse 
characteristics. 

It should be noted that this statistical analysis, including 
Figs. 8 and 9, is based on the assumption of non-inductive 
circuits, which means that the short-circuit current can jump at 
instant f9 to a value i which is entirely limited by resistance. 
This assumption applies with good approximation to our tests 
and in practice to faults where the current is limited mainly by 
cables. 
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SUMMARY 


The basic computing element of an electronic analogue computer 
js the high-gain d.c. amplifier, but in order to overcome its inherent 
drift some form of auxiliary drift-correcting amplifier is frequently 
used. 

The paper describes a silicon diode modulator for use in such 
drift correctors, and shows that the zero stability which can be 
achieved is comparable with that given by the conventional relay 
modulator. It is also shown that the use of the diode circuit results 
in a number of improvements in the overall drift-corrector per- 
formance, such as a higher frequency response and practically unlimited 
life; the design of such a drift corrector for use with a given d.c. 
amplifier is described in detail and the resulting performance is 
assessed. 


LIST OF SYMBOLS 


a = Resistance ratio of the modulator balancing arm. 
b = Diode branch resistance ratio during conduction. 
c = Diode branch resistance ratio during non-con- 
duction. 
C, = Parallel combination of the capacitance across the 
diodes. 
V = Amplitude of the switching voltage. 
i = Diode current. 
i, = Grid current. 
I, = Diode leakage current. 
I, = Diode saturation current. 


K,G,(jw) = Frequency-response function of the d.c. amplifier. 
K,G,(jw) = Frequency-response function of the drift corrector. 
KG(jw) = Frequency-response function of the drift-corrected 


amplifier. 
k = A constant. 
p = Laplacian operator. 
Ro = Amplifier or drift corrector input impedance when 
purely resistive. 
R, = Parallel combination of the diode back-resistances. 
Ry = Parallel combination of Ro and R,. 
T = Absolute temperature. 
v = Circuit input voltage, or a voltage injected from a 
supply source. 
v’ = Amplifier input voltage. 
v” = Amplifier output voltage. 
v; = Modulator input voltage. 
Ue = Drift voltage. 
v, = Modulator output voltage. 
Vo¢ = Modulator output voltage due to unbalance in the 
conducting direction. 
Yon = Modulator output voltage due to unbalance in the 
non-conducting direction. 
= Modulator output voltage due to unbalance in the 
non-conducting direction in the presence of 
capacitance. 
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publication. 
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Zp = Feedback element of the computing component 
this may be purely resistive (Ry) or pure 
capacitive (C;). 

Z; = Parallel combination of all the input elements. 

= Amplifier or drift corrector input impedance. 

@, = Carrier angular frequency. 

ow, = Signal angular frequency. 


(1) INTRODUCTION 


(1.1) General 


The usual method of drift correction used in electron: 
analogue computers has been described by Goldberg.! 
modulator converts the slow variations of the input signal to 
alternating voltage; the latter is in the form of a modulat 
carrier which, after passing through an a.c. amplifier, is demod 
lated to obtain an amplified version of the original signal. Su 
amplifiers are often referred to as chopper amplifiers, the zer 
stability obtainable being many times better than that of th 
best d.c. amplifier. 

Drift-correcting amplifiers reduce the drift in a computin 
amplifier using the circuit shown in Fig. 1(a) either with or witt 


c4 


f 


(b) 
Fig. 1—Computing amplifier. 


(a) Basic method of connecting a drift-correcting amplifier: C eliminates the effe 
of grid current arising in the d.c. amplifier. 
(6) Drift-corrected computing amplifier complete with its computing components 


out an input capacitor.” If K; and K; are the low-frequency gait 
of the d.c. amplifier and the drift corrector respectively, tl 
low-frequency gain of the combined amplifier becomes K : 
(1 + K,)K,; the high-frequency response is the same as that « 
the d.c. amplifier alone. Also, the reduction of drift of the d. 
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mplifier is proportional to (1 + Ky), and if K, is sufficiently 
arge the drift of the drift corrector alone will be of importance. 
sn general, drift correctors reduce the input resistance of the 
| ombined amplifier and increase the noise level considerably. 
Pio addition, grid-current effects are not reduced by the drift 
corrector unless an input capacitor is used as shown in Fig. 1(a). 
in the following, a short description of the various amplifier 
mperfections is given, and the main effects of the drift corrector 
re briefly examined. 


Gh 2) Drift Corrector poses 


“ig. 1(), where Z; and Z, are the computing components, Zo is 
| ie ‘input impedance, K is the low- frequency gain of the com- 
yined amplifier, i, is the grid current and v, is the drift voltage 


‘eferred to the input. Then 
i v ee v’ 2 vy’ vy’ vy’ ; 
i z, eee zee 
and since v += —v'JK, 
zi ng Z, Vb, Ly 
bt = — v” rae EL At Vig iy (1 Camel) 6) 7 
4 K aR ae . aA gene 


() 


The second, third and fourth terms on the right are the errors 
due to finite gain, drift and grid current respectively. 

The error due to finite gain can be made as small as desired 

by making K sufficiently large, but it should be noted that K is 
the low-frequency gain only, and at higher frequencies the gain 
will be less, causing larger errors. To keep these errors small, 
a good high-frequency response is required. 
_ The error due to drift is v-(1 + Z;/Z; + Z;/Z), and is inde- 
pendent of K. In an integrator with Z; = R; = 1 megohm, 
Zo = Ro = 1 megohm and Z; capacitive with C; = 1 uF, the 
error due to drift is 


i 


If vy = 10mV, this output term will change at the rate of 
20mV/s and if a maximum output of 100 volts and an accuracy 
of 0-1°% is required, the integrating time must be limited to 
only Ssec. In practice, errors of this type are usually far larger 
than any others, and this is where the main improvement in 
accuracy can be obtained by the use of drift correctors. 

The output due to grid current is i,Z, and in an integrator 
the resulting error becomes i,dt/C;. “For iz = 107 9amp and 
C; = 1 pF, a rate of change of output of 1mV/s is present and 
this may itself limit the integration time for a given accuracy. 
Clearly, in order to obtain the full benefit of reduced drift, the 
grid current must be kept low. Electrometer valves are normally 
employed, giving grid currents of less than 10~!°amp, but a 
more effective method of reducing grid current effects is to use a 
capacitive input to the d.c. amplifier, Fig. 1(a). 

The amplifier input impedance, Zp), appears in the finite gain 
and drift error terms but always in parallel with the input 
impedance, Z;. The latter seldom exceeds 1 megohm, and so Zp 
should be at least as large as this to ensure that its contribution 
to these error terms is negligible. 

The noise voltage is increased by the use of drift correctors 
and may be as large as SmV referred to the input. Normally it 
has little effect upon adders or integrators, the latter tending to 
cause a reduction in noise level. In differentiators, however, the 
reverse is true, and a very large noise output may result. For 
this reason it is desirable to keep the combined amplifier noise 
level to a minimum. 
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The above discussion indicates that the desirable features of a 
drift corrector are low drift, high input impedance, low noise 
and good high-frequency response. 

The advantages of a good high-frequency response are: 

(a) The drift-corrector gain increases, not only the accuracy of 
integration, but also that of most other computing operations. 

(b) The size of the series capacitor used to remove grid current 
effects [Fig. 1(a)] can be reduced to reasonable proportions; the 
amplifier input time-constant can now be kept small, thus removing 
the main objection to this type of circuit. 

(c) In general, a higher carrier frequency will permit considerable 
reduction in the physical size of some of the components used. 
The best known type of drift corrector uses a mechanical 

vibrator operating at some frequency below 400c/s. Zero 
stability of 0-1mV or better can be obtained during its life, 
which, however, may be limited. 

In the paper, a silicon-diode modulator is described which 
is suitable for use in drift-correcting amplifiers for computing 
applications. Its performance is comparable with the mechani- 
cal modulator in most respects, but it has the advantage of 
higher switching frequency, better frequency response and 
practically unlimited life. This unit has been designed to work 
with the d.c. amplifier used in Utac [University of New South 
Wales (formerly University of Technology) Analogue Com- 
puter], but is suitable for much wider application with only 
slight modifications. 


(2) THE DIODE MODULATOR 


If two diodes are connected as shown in Fig. 2(a) and an 
alternating voltage is applied, the diodes will alternately conduct 


(b) 
Modulator. 


Fig. 2.— 
(a) Basic circuit. 
(b) Modulator working into a tuned circuit. 


84 GLUCHAROFF AND GILBERT: THE USE OF SILICON DIODES IN D.C. MODULATORS AND 


or block. The switching voltage is balanced to earth by two 
low resistances, R; and R,, so that, if the characteristics of the 
diodes are identical over the whole operating region, their 
junction point, N, wil) remain at zero potential at every instant. 
If a direct voltage, v;, is applied to the junction point via a 
resistor Ro, then, provided that Ry is much larger than the 
forward resistance and much smaller than the backward 
resistance of the diodes, the voltage v, at the junction point 
will be a square wave alternating between zero and v,. Thus, a 
direct voltage is converted into a pulsating voltage which can 
be amplified by a conventional a.c. amplifier. 

For an input voltage v,(7) the output voltage, v,(t), is given by 


0 1G BEINN Otis | 
OM nd] 3 | ip ae 

where w, is the angular frequency of the switching voltage and 
the second term in the brackets represents the square wave. 
With a sinusoidal input, v; = A sin w,t, for which w, is much 
smaller than w,, the first term in the brackets can be neglected 
since it is filtered out in the a.c. amplifier. Then by considering 
the fundamental term only, the output becomes 


phys &, : 
LO) = WA sin w,t sin wt 


which is the equation of a suppressed-carrier amplitude- 
modulated wave; this is fed into the a.c. amplifier. 

However, if stray capacitance and capacitance required for 
modulator balancing are considered, the total input capacitance 
of the a.c. amplifier may become as high as 50 pF, and this in 
series with Ro will give severe attenuation and phase shift at 
the switching frequency of interest. 

To eliminate this effect, an inductance, L, may be used to tune 
the circuit, as shown in Fig. 2(b), in which C, is just large 
enough to pass the carrier frequency. For satisfactory operation, 
the impedance of the tuned circuit at resonance should be several 
times larger than the input resistance Rp. In this case the 
suppressed-carrier amplitude-modulated wave is applied to a 
resonant circuit tuned to the carrier frequency. The effect of 
such a circuit upon the signal frequency*»* is the same as that 
of a single RC network whose break frequency is 1/277-R2C, 
where C is the total capacitance of the resonant circuit and R 
is the total equivalent resistance; above this frequency the signal 
voltage will be attenuated at 6dB per octave. In the present 
application, an additional 250 pF is connected at the amplifier 
input, making the break frequency about 300c/s. The use of 
this larger capacitance has the advantage of rendering the circuit 
less dependent upon inter-electrode and stray capacitances, 
which are liable to vary and affect the modulator balance. 

The tuned circuit also acts as a band-pass filter; it tends to 
reduce any components of the input signal which are not super- 
imposed on the.carrier, higher harmonics, switching transients 
and power-frequency pick-up. 

A switching frequency of 10kc/s was chosen, since this was 
already available in the computer Utac. However, it should be 
pointed out that a higher switching frequency would result in 
no improvement in this application since, as will be shown 
later, the gain of the drift corrector must become less than unity 
at about 200c/s and this frequency can be easily modulated on 
a 10kce/s carrier. 


(3) THE SILICON DIODE 


In the previous Section it was assumed that identical diodes 
were used, allowing perfect balance to be obtained at the junction 


point N. In practice, the characteristics of the diodes will v 
widely and several adjustments may be required to obtai 
satisfactory balance. 

The d.c. characteristic of a silicon diode for a few tenths 0 
volt in the forward and backward directions is well defined 
the relation © 


1/2 
i= Fett —1) +E (ever ume 1) 
3 i Kb 


where J, is the reverse saturation current and v is the volta 
across the diode. 

A typical semilogarithmic plot in the forward direction 
shown in Fig. 3. Examination of the diode equation shows t 
there are three parameters to be considered, J,, I, and k. 


~4 ae 
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Fig. 3.—Forward characteristic of a silicon diode. 


Indicating the effect of a temperature change. 
— — — Type of characteristic obtained for very low diode leakage currents. 


I, varies considerably from diode to diode but may be a 
small as 10~!4amp. Its effect is to give logarithmic behaviou 
of the diode up to 0-6 volt in the forward direction; above thi 
voltage large deviations from the logarithmic law can b 
expected. 

I, is of the order of 10~°-10~'!* amp and will determine th 
current in the forward direction up to the point where saturatio 
and leakage-current effects become comparable. In Fig. < 
this is shown as a change in slope of the logarithmic charac 
teristic to roughly half of the original value below 0-4 volt (fu 
line). In the reverse direction, the total current is determined b 
I,, the effect of J, being negligible. Diodes with low leakag 
current are characterized by excellent logarithmic characteristi 
in the range +0-1-+0-5 volt (dotted line) and very hig 
resistance in the non-conducting direction. 

The constant k determines the slope of the logarithmic cury 
and as a rule does not vary widely for a given make of diode. 

Eqn. (2) also shows that the diode current is a logarithm: 
function of temperature. This temperature effect appears t 
differ only slightly for all the diodes examined, and typical curve 
are given in Fig. 3 for two temperatures. 

The a.c. performance of semiconducting diodes is usual 
characterized by their recovery time, rectification efficiency an 
capacitance, but these properties showed no significant effe 
upon the performance of the modulator at a switching frequenc 
of 10ke/s. The capacitance of a silicon diode may be as lo 
as 1 pF. 


Ir | 

i (4) BALANCE IN PRACTICAL MODULATORS 

: (4.1) Low-Frequency Switching Voltage 

If a switching voltage of square waveform and low repetition 

d ate is applied to the modulator circuit shown in Fig. 4, in order 
0 obtain zero potential difference between the diode junction N 


| 


Fig. 4.—Circuit illustrating the sources of modulator drift. 


and earth during the conducting period, the required condition 
is that 


R, _ R; + resistance of diode 1 
R, 4 + resistance of diode 2’ 


where a = R,/R, and b = (R; + resistance of diode 1)/(R4 + 
‘resistance of diode 2). 

| For a sufficiently large switching voltage of amplitude V, the 
diode resistance can be smaller than R; and R, and hence b 


ora =b 


‘will depend mainly upon these two resistors. If a~b, an 
unbalance voltage, v,,, is developed: 
Bie 2-7 Ro R, Ry ) 
BP ORE Ry) RP Ry 
1 i 
vG +a@ 1+ 3) 
provided that Ro is much larger than 
Ry, Ro R3R4, 
grep ds “Ra 


which is normally the case. 
During the non-conducting period the condition for zero 
output at N is that 


R; _ Resistance of diode | Apes 
R, Resistance of diode 2’ meta 


where c = (Resistance of diode 1)/(Resistance of diode 2), since 
the diode resistances are much larger than R, and R,. 
If a ~ c, an unbalance voltage, v,,,, is developed such that 


Ro 1 1 
Yon VR + aA +a T+ 2) 


where R, is the parallel combination of the diode resistances 
in the non-conducting period. The unbalance in this case is 
largely dependent upon the input resistance Ro. 

If the ratio a is made variable, the conditions of interest which 
may arise are: 
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(a) Voc = Von, i.e. the voltages at N during both conducting and 
non-conducting periods are the same. For this type of unbalance 
there is no alternating voltage present at N but only a direct voltage 
of magnitude voc = Von Which causes a current to flow in Ro and 
the signal-voltage source. If, as in computing amplifiers, high 
impedances are involved, this may be very undesirable. 

(b) Voc = — Von. In this case a pure alternating voltage appears 
at N. The main process of balancing the modulator is to reduce 
this voltage to a minimum; if present, it will give rise to a spurious 
output and, as will be explained later, may cause drift due to gain 
changes of the following a.c. amplifier. 

(c) Vgc # Von. This condition is a combination of cases (a) and 
(b) and so both the defects mentioned above will be in evidence to 
some extent. 


The only case in which the potential difference between the 
diode junction N and earth will remain zero at all times is when 
Voc = Yon = 0 and this condition can only be met ifa = b= c, 

Clearly, a control must be introduced to vary either b or c, 
in addition to that required to vary a, in order to meet this 
requirement. Complete balance cannot be achieved in practice, 
and condition (c) normally prevails. 


(4.2) High-Frequency Switching Voltage 


Consider now a switching voltage of a higher frequency, of 
either square or sinusoidal waveform. Capacitive and transit 
effects, i.e. the behaviour of the diodes in the intermediate con- 
dition between fully conducting and fully non-conducting, must 
be considered. 

During conduction the condition remains as before (a = b). 

During the non-conducting period the capacitive reactance 
may be many times smaller than the resistance of the diodes 
and so the condition for balance is 


__ Impedance of diode | 
Impedance of diode 2 


Thus, both the phase and magnitude of the diode branch 
impedance should be controlled. 

To ensure complete balance during the transit period the 
incremental resistances of the diodes must be equal at every 
point of their characteristics.? From eqn. (2), with J, neglected, 


: me di ekT. = jj 
i=TJ,gKT for v slightly positive, and = if = or 
Pe day dv kT kT 
atta ne 9 i.e. for every current i and temperature T the incre- 

Ll t 


mental forward resistance is a function of k only. Hence, the 
requirement for balance is that the diode constants k should be 
equal, which means equal slope of the logarithmic curves. Also, 
if the two diodes follow the above relation, the incremental 
resistances will be equal for all values of temperature and so 
no unbalance can take place due to temperature variations. 

In the presence of a large leakage current, /;, the expression 
for the incremental resistance is not a simple function of k, T 
and i only, but is a complicated non-linear function which also 
includes J, and J;. To find two such diodes with equal incre- 
mental resistances over the whole operating region is almost 
impossible, and in practice it is difficult to achieve satisfactory 
balancing unless both diodes have low leakage currents, i.e. 
unless they follow a logarithmic law over the whole operating 
region (Fig. 3, dotted line). 

The final modulator circuit is shown in Fig. 5. 

The potentiometer P, is used to obtain a suitable ratio, a, 
and C, and P, are used to equalize the phases and magnitudes 
of the diode branch impedances during the non-conducting 
period. With these three controls a balance can quickly be 
obtained by trial and error. It is important to realize that, not 
only a.c. balance is required, but also that no direct voltage 
should exist at the junction point N. This can be checked by 
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Fig. 5.—Complete modulator circuit. 
Ro 680kQ. C4 25pF. 
R; R2 = 1002. L 0-8H. 
R3 Rg = 100kQ. Py 5Q. 
C; 0-001 uF. P, 10kQ. 
C2 250pF. D;, D2 Silicon diodes type 
C3 20pF. HD 6007. 


open-circuiting the input terminal of the modulator temporarily; 
with perfect balance no output change will be caused. 

Variations in the forward resistance of the diodes will be 
reduced by R3 and Ry, and the lower the diode resistance the 
greater this reduction will be; a low forward resistance implies 
a high value of k in eqn. (2). In spite of this reduction, a simple 
matching of the diodes may be advisable. The value of R; 
and R, should be such as to limit the operation to the logarithmic 
region of the diode characteristics and yet be several times 
smaller than Rp. The variable capacitor will balance capacitive 
differences and, as shown in the next Section, will reduce the 
effect of variations of diode resistance in the non-conducting 
direction. 


(5) MODULATOR ZERO STABILITY 


Since the modulator is essentially a bridge circuit, a change 
in any of the branches will disturb the balance. If only wire- 
wound resistors (temperature coefficient 0:002% per deg C) 
are used and all capacitors are silvered-mica (0:002% per deg C) 
or air insulated, the drift due to changes in the diode charac- 
teristics is far larger than that due to changes in the other com- 
ponents. Also, in bridge circuits the output due to unbalance 
is proportional to the applied voltage, and so the switching 
voltage must be as small as possible, and yet sufficiently large 
to cause conduction in the forward direction. 

During the conducting period, drift due to changes in diode 
resistance is considerably reduced by the large fixed resistors 
in series with the diodes. The only significant cause of variations 
of diode characteristics is temperature change and measure- 
ments have shown that the drift is less than 0:02mV/degC. So 
far, no ageing effects have been observed. 

During the non-conducting period, capacitive as well as 
resistive components are of importance. Since the balancing 
capacitor is several times larger than the capacitance of the 
diode, the total capacitance is effectively constant and only 
changes in resistance need be considered. 

Consider the modulator unbalanced in the non-conducting 
direction to such an extent that an unbalance voltage, v,,,, would 
appear at N in the absence of all capacitive effects. Then witha 
suddenly applied blocking voltage, V, it was shown in Section 4.1 
that 


7. Ro 1 1 
Yon am + at ta i+ = 


Owing to the shunt capacitance, the voltage v, which actua 
appears at N is a function of time given by (Section 14) 


VU, = Vo,(1 — e— ICR?) 


where C, is the parallel combination of the capacitance acro i 
the two diodes and Rr = R,Ro/(R, + Ro)- 
The initial rise of voltage per second is then 


Pte ile 1 1 
C,Rr ori +a a 
which is independent of the input resistance, Ro. } 
Thus, for a sufficiently high switching frequency, the unbaland) ; 
in the non-conducting direction is mainly determined by thi 
time-constant C,R,, and can be improved only by increasin} 
this term. C, is part of the tuned circuit and cannot be increase}, , 
indefinitely; also, since it is effectively in series with R3 and R, 
a large balancing capacitor. will greatly increase the losses ¢| 
the circuit. R,, however, is a property of the particular diode) 
used and should be as high as possible. It is interesting to com) — 
pute the unbalance for the modulator in question at the end of ) 
blocking period for Ry = 1 megohm and Ro = = 0, With 
switching frequency of 10kc/s, t = S50microsec; also, i) 
R, = 10° ohms and C, = 30pF, 


"=V(Ga- 


1 
, ag ae 
v, v(— 2h .)t 66 x 10~% volt for Rp = ©. 


-3 tg 
ce -)o 812 X10~° volt for Rp = 1 i J 


It is clear that at 10kc/s the unbalance due to changes in diod¢) 
resistance in the non-conducting direction is reduced by 4 
factor of about 1000 and is almost independent of the inpuj 
resistor, Ro. 

The effects of drift arising in the transit period are not reduced 
by the modulator circuit as are those which occur during the 
period of full conduction and complete non-conduction. Experi-| 
mental results tend to show that this effect can be made small] 
only by using diodes whose characteristics are essentially) 
logarithmic, as mentioned in Section 4. ) 

In conclusion, it may be said that the modulator output due! 
to unbalance is mainly due to changes in diode characteristics) 
with temperature, but fortunately such changes are completely) 
reversible and give rise to no long-term drift. 


(6) THE SWITCHING-VOLTAGE SUPPLY 


The waveform of the switching voltage is not critical 4 
both square-wave and sine-wave voltages have been tried. There! 
was no significant difference in modulator performance and so} 
for practical reasons, such as transformer design, the sine wave’ 
was chosen for normal use. 

Since the modulator employs a resonant circuit, the frequency) 
of the switching voltage must be kept reasonably constant. | 
Also, any changes in its magnitude will disturb the balance (a) 
10% change may cause a drift of up to 0-5mV), and if a large’ 
number of drift correctors are to be supplied from the same} 
generator, a low source impedance is required. In Utac, for 
which this drift corrector was designed, a 10kc/s crystal oscil- 
lator is available for timing purposes and this is used to supply 
a simple a.c. regulator as shown in Fig. 6. A portion of the 
a.c. output is rectified and compared in the feedback path with 
a constant voltage derived from a reference tube; any difference- 
voltage is used to control the gain of a pentode and thus maintain | 
the output voltage constant in spite of input and load variations 
(the regulation at full load is less than 2%). The tuned circuit. 
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10 ke/s 
CRYSTAL 
OSCILLATOR 
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Which forms the pentode load ensures a sinusoidal output, free 
from distortion. The switching power required for each drift 
corrector is about 10mW and so a regulator of 1 watt output 
can supply up to a hundred drift correctors. 

| Although the use of an a.c. regulator is justified where a 
mumber of drift correctors have to be supplied, it should be 


» Stressed that the requirements are not extremely critical and a 


mormal standard-frequency oscillator can be used quite 
successfully. 


(7) THE COMPLETE DRIFT CORRECTOR 


The modulator described in Section 4 has been used to build 
drift correctors specifically designed to work in conjunction 
with the high-gain d.c. amplifier used in Utac. They consist of 
‘an input filter, modulator, a.c. amplifier, demodulator and out- 
‘put filter, and the individual sections, shown in Fig. 7, are 
| described below. 


‘ 


AC. AMPLIFIER 

MONITOR 

DEMODULATOR 

OUTPUT 

FILTER 
CATHODE 

FOLLOWER 


Fig. 7.—Circuit diagram of complete drift-corrector, 


The input filter is used to shape the frequency response of the 


| complete drift corrector, and to reduce the carrier-frequency leak 
') back to the input of the amplifier. 


Its detailed design depends 


') upon the precise application of the drift corrector. 


by a triode which feeds directly into the demodulator. 


The a.c. amplifier is RC coupled, using a pentode as the input 
stage in order to avoid Miller capacitance effects; this is followed 
The 
total a.c. gain is 1600. The only design consideration is that 


the amplifier should introduce no phase shift at the switching 
| frequency, and for reasons given below, the amplifier gain 
should remain constant in operation within about 10%. 


The demodulator employs two germanium diodes in the same 
type of bridge circuit as is used in the modulator. No adjust- 
Ments are required here and normal 10% carbon-composition 
tesistors are sutistactury. The switching voltage is provided 
from the same transformer which supplies the modulator. 
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The output filter is a simple RC integrating network having a 
break frequency of 4c/s. 

In Fig. 7 it can be seen that the output filter is directly con- 
nected to a cathode follower; this is employed owing to the 
particular requirements of the d.c. amplifier used, and the 
output is taken via a potentiometer which controls the output 
level of the drift-corrected amplifier. The overall d.c. drift- 
corrector gain is 400, and since the drift of the d.c. amplifier 
used does not exceed 40 mV, this now corresponds to a maximum 
drift of only 0-1mV. This is of the same order as the long- 
term zero stability of the modulator, and so there is no reason 
to use a higher drift-corrector gain. 

Before demodulation, a small portion of the a.c. signal is 
rectified and the voltage obtained is a measure of the a.c. 
amplifier output; this can be displayed on an output meter for 
monitoring purposes. Since the modulator balance can never 
be perfect, there is always some residual unbalance voltage at 
the amplifier input: thus, even when the modulator input is 
zero, a meter reading will be obtained which indicates this 
residual voltage. There is an equivalent output from the phase- 
sensitive demodulator, and this can be balanced out only by 
using the output potentiometer. However, should the output 
of the a.c. amplifier change under these circumstances, a form 
of drift arises; such a change can be caused by drift in the 
modulator or a change in the gain of the a.c. amplifier. In 
practice, the residual voltage may be equivalent to up to 1mV 
d.c. input, and so a 10% change in amplifier gain can cause a 
drift of 0-1mV referred to the input. Normally, a change in 
modulator balance will cause an increased meter reading, and a 
reduction in amplifier gain, a reduction in the meter reading; 
also an overload or equipment failure is immediately apparent. 

Once the drift corrector has been set up, the only control 
which requires adjustment is the output potentiometer; such an 
adjustment becomes necessary only when a different d.c. ampli- 
fier is used or when any of the drift-corrector components have 
been replaced. 


(8) FREQUENCY RESPONSE 
As was pointed out in Section 1, the drift corrector is con- 
nected so that at low frequencies it is in series with the d.c. 
amplifier and at higher frequencies the d.c. amplifier alone is 
used [Fig. 1(a)]. The overall gain as a function of frequency is 
given by 
KG(jw) = [1 + K,G,(jw)]K,G, (ja) 


A logarithmic plot of K,G,(jw), the frequency-response 
function of the d.c. amplifier, is given in Fig. 8. 

So that the combined amplifier is stable in the presence of 
feedback, the overall phase lag must be less than 180° at all 
frequencies at which the combined gain is greater than unity. 
For this to be so for the given d.c. amplifier, the phase shift 
due to the term [1 + KG (jw)] should be zero for all frequencies 
above the lowest break frequency of the d.c. amplifier, which 
is 400c/s. This will be the case if the gain of the drift corrector 
is small compared with unity for all frequencies higher than 
400 c/s, and so the design criteria for the frequency response of 
the drift corrector are: 


oe Gain to be less than unity for all frequencies greater than 
00 c/s. 
(b) Phase shift to be less than 180° at all lower frequencies. 


The actual frequency response is shown in Fig. 9. The 
attenuation of 9dB per octave is obtained by suitably choosing 
the input and output filters, and the attenuation of 12dB per 
octave above 300c/s is due to the output filter and the resonant 
effect of the tuned circuit. It is seen from Fig. 9 that the drift- 
corrector gain becomes unity at 200c/s, and with the d.c. 


88 GLUCHAROFF AND GILBERT: THE USE OF SILICON DIODES IN D.C. MODULATORS AND 


120 a | | 5 es 


100 ~ al 


© 60} 
v 
2 
ia eee ol eee OL RO 
© 40 i < 
Se | = Ea oop 
wn 
= 
20 remain fe, cle N- = -80°a 
Ww 
0) 
2 3 4 5) 6 7 
10 10 10 10 10 10 10 


FREQUENCY, c/s 


8.—Gain and phase response of the d.c. amplifier for which 
the drift corrector was designed. 


Fig. 


60 


40 


GAIN, dB 
PHASE LAG 


-20 
01 1 10 10 10 
FREQUENCY, c/s 


Fig. 9.—Gain and phase response of the drift corrector, 
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Fig. 10.—Gain and phase response of the complete drift-corrected 
amplifier. 
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amplifier this gives the combined frequency response of Fig. 10, 
which is satisfactory. 


(9) DRIFT CORRECTOR IMPERFECTIONS 


This Section deals with the effects upon computation of the 
particular drift corrector described in Section 7. 


(9.1) Input Impedance 

With the drift-corrected amplifier used in a circuit as show, 
in Fig. 11, for any input voltage, v, an output voltage, v”’, 
be established; its size will depend upon the computing con} 
ponents Z; and Zy. Also, an error voltage, v’, will be set up 4 


Fig. 11.—Drift-corrected amplifier in a normal computing circuit. | 
the input of the amplifier. For low frequencies, the paralle| 
branch of the input filter is ineffective, and the error volta 
will work into the impedance offered by the modulator plus th¢ 
series resistor of the input filter only (Figs. 5 and 7). During 
the non-conducting period the modulator impedance is effec} 
tively the impedance of the tuned circuit at resonance, which) 
is several times larger than the input resistance, Ro. During the 
conducting period it is Rp plus the series resistance of the filter, 

Thus, a mean current, given very nearly by 


i’ = 4v’/(Ry + 270000) amp 


will flow and so the input resistance may be taken to 
2(Ro + 270000), which is approximately 2 megohms. 

For all higher frequencies, the parallel branch of the input 
filter must be considered, and at frequencies well above 10c/s 
the input impedance will again become resistive, the approximate 
value being (270 + 47)103 = 0-317 megohms. 


(9.2) Modulator Drift 


Consider the drift-corrected amplifier shown in Fig. 11, with 
the input lead connected to earth; in the absence of drift, 
v=v’=0. Ifa drift voltage appears at the junction N of 
the modulator (Fig. 5), it will give rise to an output voltage, v”, 
which in turn will set up an error voltage, v’, tending to reduce 
the effect of the drift. The input impedance offered to v’ 
depends to some extent upon whether the modulator drift is 
due to unbalance in the conducting or the non-conducting 
period (see Section 4). In practice, however, this has no 
significant effect and modulator drift can be considered in 
exactly the same way as that arising at the input of a normal 
d.c. amplifier. 


(10) PERFORMANCE 


During the development of the drift corrector a number of 
commercial types of silicon diode were investigated in an 
attempt to find the most suitable one. These tests were certainly 
not exhaustive, but it was found that certain types contained a 
reasonable proportion of diodes which had characteristics of 
the type indicated in Fig. 3 (dotted line). Thus, from a total 
of 30 silicon diodes of a suitable make, about half were suitable 
for use in the modulator. The d.c. forward characteristics were 
measured and the diodes matched in pairs. 

A total of ten drift correctors has been built and the per- 
formance observed over a long period when used in conjunction 
with the Utac d.c. amplifier under normal operating conditions. 

After operation for over four months, none of the drift 
correctors showed a drift, referred to the input, larger than 


‘ter the computer had reached a steady operating temperature. 
jhe maximum value of drift at first switching on was I mV, 
/it this figure could be reduced to less than a half if the heater 
jbwer was left on when the rest of the computer was switched 


| It is thought that better performance could be obtained if 
me form of temperature control were to be employed. The 
verage temperature coefficient of the drift corrector on its own 
| 0:03 mV/deg C, the temperature being measured within the 
jodulator compartment. 

The low-frequency gain of the drift-corrected amplifier is 
D x 10° up to 4c/s; for higher frequencies the gain falls at 9 dB 
jer octave. The effective input resistance is 2 megohms and the 
‘tal noise level, referred to the input, is 1mV peak-to-peak, 
iainly at 50c/s. The switching power required is 10mW per 
_rift corrector. 


‘| (11) CONCLUSIONS 


The above results show that silicon diodes can be used 
uccessfully in drift correctors for computing amplifiers. Tem- 
/erature is the only significant cause of drift, the effect being 
ompletely reversible. The circuit developed tends to reduce 
ll drift except that arising during the transit period from full 
onduction to non-conduction; this can be kept small only by 
ising diodes with logarithmic characteristics. 
| In addition, the silicon-diode modulator has a number of 
\ither desirable features: these include good high-frequency 
esponse, low noise level, low switching power requirement 
nd an operational life limited only by the electronic valves. 
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14) APPENDIX: UNBALANCE DUE TO CHANGE OF DIODE 
BACK-RESISTANCE 
In Section 4 it was shown that the unbalance due to a change 
of diode resistance in the non-conducting direction in the 
ibsence of all capacitive effects is 


1 
han Bo te a) 
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The circuit capacitance, however, will considerably affect this 
unbalance. The time-constant of the series resistors R3;and Ry 
(Fig. 4) and the balancing capacitors is only 2 x 10~°sec, and 
in the following analysis can be neglected. Thus, the modulator 
circuit can be simplified as shown in Fig. 12(a@), where R’, R” 


= 


Fig. 12.—Modulator circuits. 


(a) Simplified circuit applicable only when the diodes are non-conducting. 
(b) Equivalent circuit for a step input of switching voltage in the non-conducting 
direction. 


and C’, C” are the respective resistances and capacitances of 
the two diodes. The conditions for balarice can be stated as 


R, R Gu. 
a= — = 


RZ (OF 


In the following it is assumed that a = R,/R, = C”/C’ but 
R’[|R” =c#a, and the output, v, developed across Ro is 
derived for a suddenly applied blocking voltage of magnitude V. 

The open-circuit value of v, is given in Laplace transform 
notation by 


1 


1 
Rate (Cl 
R R” T?P 
= VO) eR 
| 

1 se 1 if 
R’ =p G R =r faxe. 

we R'(L + pCR’) 
SLR coho (ROG peek) 


where R, = R’R’/(R’ + R”) and C, = C’ + C” 
For a step input V(p) = V/p 


Rd ais pCR’) | 
(R’ + R“)(1 + pC,R,) 


=) |= 1 
Ed ae aa 


Cau 


VPrer 
v)) =S — 
oP) Dp is ae R; 


or —-,(p) = V(; _ , (3) 


The circuit can now be represented as in Fig. 12(b). Assuming 
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that Ry is much larger than R,R,/(R; + R,) the voltage », The value of v,,(p) given in eqn. (3) can now be substitut 
developed across Rp is given by in this expression, and allows a value of v, to be obtained as 
function of time. Thus: 


R 
Op) =p) pane ee 
t to R 4 1 Ry : : Bi 
0 eee Cink Gaon > iat & PR) 
R, P ‘P 
—10p (0D) wie Rol st pep Ry eee or V(t) = Vg_(1 — Ee #!CrR 7) 


RoR, 
(Ro + R(t We OR, + R,/ — where Rr = RoR,|(Ro + Rp) 
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SUMMARY 


The properties of the solid conductivity state below 600°K are 
lonsidered in the present Part. For comparative purposes the con- 
luctivity at 420°K is given the title of ‘low-temperature reference 
lonductivity 0429’, and this function is studied in relation to its 
mvironment. It is found that 4209 increases in almost linear fashion 
vith increase of conditioning temperature and that it also increases 
vith increase of chemical activity of the core metal. At constant 
emperature 0429 is invariant over a wide range of applied voltage 
nd current, and shows a high order of stability with time. Perhaps 
he most surprising result to emerge is the observed vulnerability of 
he solid conductivity to low-pressure oxygen attack, even at a tem- 
erature as low as 300°K. 


LIST OF PRINCIPAL SYMBOLS 


0429 = Low-temperature reference conductivity (measured at 
420° K), micromhos. 
0399 = Conductivity measured at 300° K, micromhos. 

1929 = Conductivity of the matrix at normal operating tem- 
perature (1 020° K), essentially a vacuum-phase con- 
ductivity, millimhos. 

Tin; = Temperature of intercept of conductivity phases, deg K. 
Ton = Conditioning temperature, deg K. 
Ty709 = Low-temperature total emission (measured at 700° K), 


Ry = Resistance of the matrix, ohms. 
Ro = Characteristic value of matrix resistance, ohms. 


(1) INTRODUCTION 


It is proposed in the present Part to examine some of the 
roperties of the solid semiconduction phase of the oxide matrix. 
‘ig. 1 shows a typical characteristic of the variation of log-con- 
luctivity with reciprocal of absolute temperature for a standard 
}-type* assembly, and the solid conductivity phase can be broadly 
egarded as covering all temperatures below (7;,,— 100)°K. 
t was shown in Part 4 that below (7,,, — 100)°K effectively 
he whole of the matrix current is carried by a solid conduc- 
ion process, that no act of thermionic emission is involved 
nd that heat is uniformly dissipated throughout the matrix. 
n Part 6 it was further shown that the solid conduction 
hase has a negligible magnetoresistance, in marked contra- 

* Specification of standard S-type assembly. 


Cores: Pure platinum, 4% tungsten-nickel or active nickel. 
Matrix: Co-precipitated equimolar barium-strontium oxide. 
Matrix density: About 1-0. 

Matrix thickness: 150u.. 

Matrix area; 0-25 cm2. 


pipe apeebtcy are vacuum-processed to the standard schedule detailed in Table 1 


This paper is a continuation of Monographs Nos. 221 R and 243 R, published in 
ebruary and June, 1957 (see 104 C, pp. 316 and 496), Nos. 268 R, 269 R and 289 R, 
ublished in December, 1957, and February, 1958 (see 105 C, pp. 183, 189 and 374) 
nd No. 317 R, published in November, 1958 (see 106 C, p. 55). 

Corres on Monographs is invited for consideration with a view to 
ublication. 


Dr. Metson and Miss Macartney are at the Post Office Research Station. 
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Fig. 1.—Typical temperature characteristic of the conductivity of a 
platinum-core S-type assembly. 


distinction to the vacuum phase. The present Part is concerned 
with the factors which determine the level of solid conductivity, 
its relationship to the level of vacuum conductivity and certain 
of its electrical properties. 


(2) SOLID SEMICONDUCTIVITY IN AN S-TYPE ASSEMBLY 
WITH PLATINUM CORES 


(2.1) Thermo-Chemical Action of Platinum on Alkaline-Earth 
Oxides 


Semiconductivity of the oxide matrix results from its activa- 
tion by an excess of barium metal. This excess may be generated 
by a direct thermo-chemical core-reducing action of the form 


X + BaO — XO + Ba 


or by a dissociation of the matrix under the influence of current 
flow. It is proposed in the present Part to examine only the 
effects of activation arising from thermo-chemical action. 

It is commonly believed that platinum has no useful reducing 
action at 1000°K on the alkaline-earth oxides, but experience 
at Dollis Hill seems to show that this view is incorrect. Con- 
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ventional diodes of the 6D15 type with platinum cores have, for 
example, run under zero current load for six years without 
deterioration of total emission.* 

It seems probable that losses of excess barium from the matrix 
by evaporation and random gas poisoning must have occurred 
during this period and that such losses have been made good 
by a continuing thermo-chemical action. More direct evidence 
is afforded by the following experiment. An oxide-coated 
platinum cathode core fitted with interior heater was suspended 
by a platinum tape at the centre of an evacuated glass envelope. 
After the oxide had been heated for 8 hours at 1 700° K a heavy 
deposit was observed on the cool walls of the glass envelope. 
The outer fringes of the deposit were a series of colourless layers 
detectable only by their demonstration of Newtonian rings. 
These fringe layers were clearly deposits of condensed alkaline- 
earth oxides. Occupying the centre of the deposit was a heavy 
gtrey-black film which disappeared instantly on admitting air to 
the evacuated envelope. If the alkaline-earth oxide is evaporated 
in vacuum at 1700°K from a surface of pure degassed alumi- 
nium oxide, the deposit is almost invisible except for Newtonian 
rings and shows no sign of grey stain. It is assumed that the 
grey stain in the first sequence is alkaline-earth metal resulting 
from a thermo-chemical reduction of the oxide by platinum. 


(2.2) Thermo-Chemical Activation 


Thermo-chemical activation was examined on a standard 
S-type assembly fitted with platinum cores and processed to the 
standard schedule set out in Part 1 of the paper. The level of 
matrix activation, or its concentration of excess barium, is 
identified with the magnitude of the solid conductivity, which is 
measured at 420°K, i.e. nearly 300° below the intercept tem- 
perature, T;,,. This conductivity at 420°K will be used exten- 
sively and referred to as the ‘low-temperature reference con- 
ductivity, G49. It is a dc. measurement expressed in 
micromhos. 

The valve was subjected to the following sequence of events. 
The matrix was set at a conditioning temperature, T,,,, of 
1050°K for 3min, allowed to cool to room temperature and 
then raised to 420°K for measurement of o4 9. The whole 
process was then repeated with the matrix set at a conditioning 
temperature of 1 100° K for 3 min, allowed to cool and measured 
for 049. This sequence was continued until a characteristic of 
0429/Teon Was obtained over a range of T7,,, from 1050 to 
1600°K. The only current passed through the matrix during 
the whole sequence was less than the 100 zA necessary to measure 
the conductivity at 420°K. An example of an experiment 
carried out on such lines is set out in Fig. 2, which shows that 
0429 increased with T,,,, to a maximum at about 1500°K. If 
the reverse process on a descending temperature scale is now 
executed, the characteristic follows a new path, as shown in the 
Figure. This complex behaviour is, in fact, more apparent than 
real and is due to a one-time sintering action which alters the 
physical configuration of the matrix reticule. If, for example, 
the cathode was sintered at the beginning of the experiment by 
holding the matrix for 3 min at 1 600° K, the characteristic could 
be cycled indefinitely between 1 050° and 1 550° K in the reversible 
manner illustrated by Fig. 3. A further sintering occurred when 
Tron exceeded 1600°K and the characteristic was again dis- 
placed. The true form of the o49/T,,., characteristic is therefore 
that given in Fig. 3, which shows that the matrix activation or 
low-temperature conductivity is a direct function of conditioning 
temperature. 

An alternate experimental approach is illustrated in Fig. 4. 


* The type 6D15 diode is described in Part 3 of the paper. It has a 2-watt in- 
directly-heated cathode and a close-spaced wire-grid anode. The total emission from 
the cathode at 1000°K is about lamp/cm2. This value is estimated from a low-tem- 
perature measurement at 700° K, where no electrolytic activation is to be expected. 
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Fig. 2.—Low-temperature reference conductivity as a function of 
conditioning temperature. 


x x <* Conditioning temperature increasing. 
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Fig. 3.—Low-temperature reference conductivity of a sintered 
matrix as a function of conditioning temperature. 


x X xX First run, temperature decreasing. 
ooo Second run, temperature increasing. 
eee Third run, temperature decreasing. 


A standard S-type assembly with platinum cores has been su 
jected to four different temperature conditionings on a risit 
temperature scale. After each conditioning the variation of lo 
conductivity with 1/7 has been drawn and the resulting fami 
shows the manner in which an increasing matrix activity affec 
both solid and vacuum conductivities. The ordinate at 420° 
gives the characteristic of 0439/T2., similar to that of Fig. 3. 


(2.3) Reversibility 


The mechanism of the activating action is thought to be 
thermo-chemical one of the form 


nPt + mBaO —> Pt,O,, + mBa 


This action at the matrix-core boundary is likely to be irreversib 
and should lead to irreversibility in the o49/T,., characteristi 
The fall of solid conductivity with reduction of conditionir 
temperature observed in Fig. 3 will therefore require explanatio 
A preliminary experiment to this end is shown in Fig. 5. 

standard S-type assembly with platinum cores was condition 
for 3 min at 1 400° K under zero current load and then measure 
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Fig. 4.—Conductivity as a function of reciprocal of temperature 
after conditioning at various temperatures. 


eee | 


BO- = 


LOW-TEMPERATURE REFERENCE CONDUCTIVITY, 0)... MICROMHOS 


60) 
a OT= 420k 
40} 
20h fou ee 
fe) 50 Tele) 50 200 350 
TIME, MIN 


Fig. 5.—Stability of low-temperature reference conductivity as a 
function of operating temperature—I. 


IT O49. ‘The assembly was then set at 420° K for 120 min with 
ccasional measurement of 049. The temperature was next 
ised to 1020°K and the valve run for a further 120 min with 
itermittent coolings for measurement of o439. It was finally 
in at 1400°K for 30min with checks for a4) every Smin. 
he characteristics shown in Fig. 5, taken in conjunction with 
lose of Fig. 3, bring out the following points: 


(a) Between 1020°-1550°K the solid conductivity, 429, is a 
function of conditioning temperature, Ton. 

(6) At 420°K the system is ‘frozen’ and 0429 is stable with time. 
This has been found to be true for testing times up to 200 hours. 
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It follows from the above that the system ‘freezes’ somewhere 
between 420 and 1020°K and that o459 is invariant with time 
below this ‘freezing’ temperature. The results of a search for 
the ‘freezing’ temperature are shown in Fig. 6. A standard 
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Fig. 6.—Stability of low-temperature reference conductivity as a 
function of operating temperature—II. 


S-type assembly with platinum cores was conditioned for 3 min 
at 1400°K and then run for successive periods of 6 hours each 
under zero current load at temperatures ranging from 400 to 
1000°K in steps of 50°. Measurements of o429 were made at 
hourly intervals, and the function was found to be invariant 
with time in the range 400-800°K. To avoid overcrowding 
the graph these measurements have been omitted from Fig. 6. 
Instability was just apparent at 850°K, and thereafter became 
increasingly rapid. After a run in which decay occurs it is, of 
course, necessary to recondition the tube for 3 min at 1400°K 
to regain the common start point. The experiment is a laborious 
one and was carried out once by each author, who arrived inde- 
pendently at the conclusion that freezing occurs in the range 
800-850° K. 

The explanation for the freezing effect is thought to be simple 
and to be in an experiment described in a previous work.! In 
this earlier investigation a conventional form of diode (6D15) 
with a platinum core was used and the stability of the low- 
temperature reference total emission, J7799, was examined as a 
function of the operating temperature, T.,,.. The diodes were 
processed to a high initial level of activation and run in groups 
at different operating temperatures with measurement of [7709 
at suitable intervals. The resulting characteristics of [p799/time 
at constant T,,,, over a range of T,,, are set out in Fig. 7 and are 
directly analogous to the o49/time characteristics in Fig. 6. 
Comparison of the two sets of results show that both low- 
temperature reference conductivity and total emission are stable 
for all operating temperatures below 800°K, but for higher 
temperatures both functions become progressively unstable. The 
cause of the instability of I-39) was shown in the previous work! 
to be due to a flow of excess barium metal from the matrix into 
the platinum core, where it forms a distinctive barium-platinum 
alloy. This loss of activator by the matrix decreases its low- 
temperature reference total emission—and likewise its low- 
temperature reference conductivity. In short, the analogous 
results of Figs. 6 and 7 are due to activator extraction from the 
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Fig. 7.—Stability of low-temperature total emission of a platinum-core 
cathode as a function of operating temperature. 


matrix by the platinum core metal, and this extraction continues 
until barium equilibrium exists between matrix and core alloy. 

The reversibility of the o439/T,,, characteristic in Fig. 3 is now 
explicable. With increasing temperature the reaction rate 
between platinum and the alkaline-earth oxides increases, the 
concentration of excess barium increases, and, in consequence, 
the low-temperature reference conductivity rises. With decrease 
of temperature the barium production rate falls and the over- 
balance of barium left from the previously higher temperature 
flows into the platinum core to form a barium-platinum interface. 
On a nickel-alloy core the same sequence is followed on a rising 
temperature run, but since barium is insoluble in nickel, there 
is no decrease of o429 on a subsequent falling temperature 
run, i.e. the overbalance of barium remains in the matrix and 
the reversible characteristic of platinum shown in Fig. 3 is lost. 
In short, the platinum system has a barium ‘sink’ which is absent 
in the nickel system, and it is this ‘sink’ which gives to the 
platinum system a peculiar advantage in demonstrating thermo- 
chemical activation in a convincing cyclic fashion. 


(2.4) Relationship between Magnitudes of Solid and Vacuum 
Conductivities 


The manner in which the vacuum conductivity changes with 
solid conductivity was easily established, using a standard 
platinum-core S-type assembly. The relationship required was 
that between the solid conductivity, o49, measured at 420°K 
and the conductivity, oj;939, measured at 1020K, which is 
essentially a vacuum conductivity. 

The matrix was first brought into a condition of high activation 
by a 3min run at 1550°K, and then measured for o439 and 


C1020. The temperature was then maintained at 1020°K 
induce a fall in activity by barium migration into the core, 

the two conductivities were measured as functions of time. 
more convenient parameter than oj 979 is its reciprocal—ma’ 
resistance Ry—which may be compared with the characteri 
resistance Rp. Three typical sets of characteristics are gi 
in Fig. 8. 
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Fig. 8.—Variation of Rg/o420. 


Two points emerge from these characteristics, namely 


(a) For lower states of activation corresponding to o429 
50 micromhos the vacuum resistance at 1020°K changes rapid 
with change of 0420. 

(b) For higher states of activation corresponding to 0429 
50 microhmos the vacuum resistance is largely unchanged wi 
changing 0429, and tends towards the characteristic resistance. 


The reason for the existence of the Rp state is now apparer 
the achievement of a quite moderate activation state brings ; 
into the range 16 + 2 ohms, and any further activation leaves 
within this range. 


(2.5) Action of Oxygen Gas on Solid Conductivity 


At 420° K the conductivity of the matrix can be regarded as 
solid-state phenomenon based on the presence of an excess 
the alkaline-earth metal. The current may pass through t 
solid bodies of chains of contiguous particles or may be limit 
to the surface of such chains. An attempt has been made 
examine these two simple views by injecting low-pressure oxyg 
gasinto thesystem. If, at 420°K, excess barium metal is ‘froze 
in the oxide lattice and oxygen atoms are unable to penetra 
the lattice surface, then gas action should be capable of int 
fering only with a surface conductivity. 

The valve for the investigation was rather more elaborate th. 
usual, containing a standard S-type assembly with platinu 
cores, a tungsten spiral coated with barium peroxide as ; 
oxygen generator, and a conventional form (6D15) of ‘oxic 
cathode diode to act as an independent indicator of catho 
poisoning. The whole arrangement was vacuum processed 
the usual way, and the two oxide devices were run into a stat 
state. The temperature of the S-type assembly was then reduc 
to 420°K and its low-temperature reference conductivity co 
tinually observed. The indicating diode was meanwhile operat 
with its oxide cathode set at 1020°K and its anode curre 
monitored. When both systems had settled down, a slow! 
increasing voltage was applied to the peroxide-coated tungst 
spiral and a simultaneous time record taken of 49 for t 
S-type assembly and of anode current in the diode. A result 
set out in Fig. 9, and this can be regarded as typical of a numb 
of measurements on other valves. Decay of the two char 
teristics set in almost simultaneously, and both then fell asympt 
tically toward zero. The pressure of oxygen producing the 
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‘ig. 9.—Action of oxygen gas on solid conductivity of platinum-core 
S-type assembly. 


| 

\ffects was not measured, but is estimated from past experience 
jf diode decay rates to be about 10-*mm Hg. 

_ Comment on the several phenomena mentioned in this Section 
will be reserved for Section 5. 


(3) INFLUENCE OF CORE METAL ON SOLID 
CONDUCTIVITY 


(3.1) Comparison of Core Metals 


' In the previous Section it was suggested that platinum is 
capable of reducing barium oxide and that a continuous stream 
of barium is liberated at the oxide-metal boundary. The rate 
of barium generation must be very small, since careful examina- 
tion of platinum-cored oxide cathodes which have run for 
50000 hours at 1 020° K show no signs of an evaporated deposit 
of barium metal on the glass envelope. In contrast, similar 
valves fitted with the conventional active O-type nickel cores show 
a very marked deposit of metallic barium* in 3000 hours of 
running. The rates of generation in the two cases are thus 
significantly different. 

It is proposed in the present Section to examine the influence 
of a change in barium generation rate on the magnitude of solid 
semiconductivity as measured in the standard S-type assembly. 
To provide a graded rate of generation, the core metals used 
were: pure platinum, nickel with 4% tungsten, and O-type 
nickel containing 0-05°% magnesium and silicon. The three 
metals are regarded as feeble, moderate and strong in their 
respective barium generation rates at a common temperature. 

Three sets of valves with the three different core metals were 
made up to the standard specification and subjected to the stan- 
dard vacuum processing. Each was then brought into the state of 
sharacteristic resistance by thermal means,{ and its variation of 
log-conductivity with reciprocal of absolute temperature was 
measured. The results are set out in Fig. 10 in the form of four 
snvelopes—a common one for all of the vacuum conductivities 
and a separate one for each of the three solid conductivities. 
That all of the valves have roughly the same vacuum conductivity 
follows, of course, from the concept of the characteristic resis- 
lance, but the difference in magnitude of the three solid con- 
ductivities is perhaps surprisingly large. In round figures, at 
420° K the ratios of the solid conductivities of platinum, nickel- 
tungsten and O-type nickel are 1 : 10 : 100. 

* The film is recognized by its grey colour and its instant disappearance on opening 
he tube to air. Films of nickel and magnesium are similar in appearance but are 

on admission of air. 


t+ The characteristic resistance of an S-type assembly was defined in Part 2 of the 
saper (Section 1.3). 
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Fig. 10.—Influence of core metal on the magnitude of solid 
conductivity. 


The black area represents the vacuum conductivity phase for all core metals. 
The shaded areas represent the solid conductivity phase for (a) active nickel, 
(6) tungsten nickel, and (c) platinum. 


(3.2) Action of Oxygen Gas on Solid Conductivity with 
Active Nickel Cores 


The action of low-pressure oxygen gas on the solid conductivity 
of the S-assembly with O-type nickel cores was examined in the 
same way as that described in Section 2.5 for the platinum-core 
case. To avoid the possibility of nickel oxidation, however, the 
temperature at which the conductivity was observed was reduced 
from 420°K to the room temperature of about 300°K. The 
result set out in Fig. 11 is generally similar to the platinum case. 
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Fig. 11.—Action of oxygen gas on solid conductivity of active 
nickel-core S-type assembly. 


The delay in onset of conductivity decay may possibly be due to 
a slower rate of penetration of the gas into the S-assembly at the 
lower testing temperature of 300°K. The pressure of the gas is 
again likely to be around 10-4mm Hg. 
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(4) ELECTRICAL PROPERTIES OF THE SOLID 
CONDUCTIVITY 


(4.1) Temperature Coefficients 


It will be apparent from Fig. 10 that, apart from different 
magnitudes of solid conductivity, the three core types have 
noticeably different temperature coefficients. The magnitude of 
the coefficient is conveniently expressed as an activation energy 
in electron-volts and it appears to decrease with increase of 
chemical activity of core metal. Numerical values are given in 
Table 1. 

Table 1 
CHANGE OF LOW-TEMPERATURE ACTIVATION WITH INCREASE 
OF CHEMICAL ACTIVITY OF CORE METAL 


Mean activa- 
tion energy 


Number of 
samples 


Chemical 


Core metal Agana 


Spread 


eV 
0-23 
0-15 
0-11 


Platinum ; 
Nickel-tungsten 
O-type nickel 


Feeble | 23 
Moderate | 20 
Active | 


The activation energy quoted by Loosjes and Vink? for a core 
metal which appears similar to O-type nickel in composition is 
0:09 eV, which compares well with the value of 0:11 eV given in 
Table 1. 


(4.2) Current/Voltage Relationship 


The current/voltage relationship of the solid semiconductor 
has been measured with considerable care at 420°K. To ensure 
both short- and long-term constancy of temperature, the supplies 
to the two core heaters were taken from a high-grade d.c. 
stabilized power-pack. 

A first example of a characteristic covering the range 0-300 uA 
is set out in Fig. 12, where a straight line from the origin has been 
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Fig. 12.—Current/voltage characteristic measured at 420° K—I. 


drawn through the experimental points. The characteristic is 
obviously ohmic. A detailed search at 0-1-volt intervals over 
the range 0-100 A confirms this ohmic character and shows 
that, in the absence of thermo-electric potentials due to small 
difference in core temperature, the characteristic always passes 
through the origin. 

A second example taken over the range 0-1 200 pA is set out 
in Fig. 13. In this case the maximum power, VI, developed in 
the system at V = 18-0 volts is 21 mW and this is sufficient to 
raise the matrix temperature by about 4°K. This temperature 
increase leads to a rise of conductivity which must result in a 
departure from linearity of the recorded characteristic. The 
difficulty of power dissipation can be avoided by using a pulse 
technique and allowing the system to cool after each application 
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Fig. 13.—Current/voltage characteristic measured at 420° K—II. | 


Measured with short pulses. -——— From continuous measurement. 


of voltage. The principal characteristic in Fig. 13 was obtaine|) 
using a lsec voltage application followed by a 1 min coolinj 
period, and the result is linear over the whole current rang¢ 
The dotted line shows the result of a continuous application 
voltage and the departure from linearity becomes apparent fa) 
currents greater than 600 uA. The movement from one chara¢ 
teristic to the other is quite slow and the pulse technique 
therefore easy to apply and apparently trustworthy. 


(4.3) Electrical Stability of Solid Conductivity with Time 


The electrical stability of 429 with time has been investigate; 
under various current loadings. The measurement requires ] r 
high degree of matrix-temperature constancy, and this has beeil 
provided. A typical set of results for a platinum-cored S- 
assembly is shown in Fig. 14, in which the value of 439 is 
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Fig. 14.—Resistance/time characteristics at 420°K. 


out in reciprocal form as ohms. The 100-hour test is carrie¢ 
out with a current of 300 uA and an applied voltage of 5-0 volts 
and the longer-term test at 1200 wA with a voltage of 18-0 volts, 
Both results show a high degree of electrical stability of lo 
temperature conductivity with time. The stability of the system} 
at 18-0 volts is interesting, since this impressed voltage is well) 
above the ionizing potential (about 13 eV) of the common gases. 


(4.4) Mechanical Stability 


In addition to electrical stability, the solid conductivity at 
420° K shows,a high degree of mechanical stability. Since the 
matrix consists of a large number of widely dispersed particles 
sintered together at point contacts to form a filamentary reticule, 
the overall mechanical strength is perhaps surprising. An 
example of a test conducted at 420°K on a standard S-type 
assembly with nickel-tungsten cores is shown in Fig. 15. A 
potential of 1 volt is held constant across the matrix and shows 
an initial value of 049 of 165 micromhos. The test is carried 
out by striking the glass envelope of the assembly successively 
in four different quadrantal directions in cyclic fashion and 
observing the conductivity after each impact. By this means 
any preferred direction of applied force is sought out and 
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Fig. 15.—Mechanical stability of solid conductivity at 420° K. 


xamined repeatedly. The impact is applied to the envelope 
yy an ordinary lead pencil swung to give a sharp rap. After 
‘| small initial decrease in conductivity the system shows little 
 thange over some 30 or more impacts. 


(5) SUMMARY AND COMMENT 


| | The experimental observations of the present part may be 
,. juMMmarized in the following manner: 
o | 


(a) The low-temperature reference. conductivity, o420, of a 
platinum-cored S-type assembly increases with increase of con- 
.. ditioning temperature, T-o,. There appears to be a definite relation- 

"ship of the form 6429 = f(T con), and equilibrium is achieved rapidly 
above 1 200° K. 
(b) For the same operating temperature the function o429 increases 
’ with increase of ability of the core metal to reduce the alkaline-earth 
oxides chemically. 

(c) There is a definite relationship between the high-temperature 
vacuum conductivity, ©1029, and 0429. As 06429 progressively 
increases, 01929 makes an asymptotic approach to the reciprocal of 

the characteristic resistance. 
| (d) The low-temperature conductivity is rapidly and almost 
})| wholly destroyed by oxygen. This deactivation has been observed 
| in the temperature range 420-300° K 
| (e) At 420°K the solid conductivity is observed to follow an 
_ accurately ohmic relationship over a wide range of applied voltage. 
| (f) The electrical characteristics of the solid conductivity at 
| 420°K are observed to have a high degree of constancy with time 
over a wide range of power loading. 
| (g) The solid conductivity at 420° K shows considerable stability 
_ under a condition of repeated mechanical shock. 


In the work on the platinum-cored S-assembly leading to the 
| elationship F420 = f(T.on) it is suggested that the activation 
‘eading to the increase of solid conductivity with increase of 
“\emperature is due to a chemical reducing action of platinum 
i) metal on barium oxide. This.is supported by the evidence of 
Section 2.1, and is a convenient idea which enables us to include 
y platinum with nickel-tungsten and O-type nickel as purely thermo- 
i) shemical activators. It is, however, equally possible that the 
durely thermal activations of the platinum-core cases in Figs. 3 
“jand 4 are due to a thermo-dynamical action involving only the 
‘natrix wherein the oxide lattice comes into equilibrium with a 
definite concentration of excess barium for any ogg tem- 
erature. The experimental work of Moore and Allison,? who 
successfully activated barium-strontium oxide on a chemically 
“neutral base of magnesium oxide lends support to this alter- 
, lative view. 

/)| The second point of comment concerns the rather surprising 
» vulnerability of the solid conductivity at low temperature to 
|) Ow-pressure oxygen attack. It had been anticipated by the 
‘wthors that the solid conductivity, supposedly dependent on 
‘he excess barium atoms uniformly distributed within the crystal 
)attice of the solid matrix particles, would be largely unaffected 
|) oy the oxidation of the relatively few excess barium atoms lying 
s}on the solid particle surfaces. There is, for example, likely to 
}e only one excess barium atom lying on the surface of a typical 
! VoL. 107, Part C. 
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matrix particle for every 10000 contained in the interior. 
Observation confounds anticipation, however, and, assuming an 
absence of lattice diffusion at room temperature by both oxygen 
and barium, it would seem that the solid conductivity is largely 
a surface phenomenon. 

A last point of comment will be made in respect of the form 
of the logo versus 1/T characteristic in the solid conducting 
phase. The publications of at least six workers have recorded 
the characteristic as a linear one and the results of the present 
Part are in accordance with this view. In a recent publication, 
however, Higginson* suggests that the solid conduction may be 
more complicated and that the characteristic below 600° K con- 
sists of two distinct linear portions with different activation 
energies. As a result of this suggestion the authors have made 
a careful recheck of the characteristic in the range of temperature 
from 600° K down to room temperature, but can find no evidence 
of any systematic discontinuity. Typical results for a group of 
active nickel-cored S-type assemblies are set out in Fig. 16, 
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Fig. 16.—Extended measurements of low-temperature characteristics 
of conductivity. 


wherein deviations from linearity are regarded as being due 
solely to experimental error. One of the authors has on several 
occasions observed non-linearity of the characteristics in 
platinum-cored S-type assemblies when the resistance of the 
system at 1020°K is well above the characteristic resistance. 
This phenomenon has been ascribed to the movement of negative 
ions of gas within the matrix pores, and it disappears when the 
system is brought into the minimum resistance state by thermal 
means (i.e. the gas has been expelled). The system used by 
Higginson is such as the authors would expect to be liable to 
gas contamination of the matrix. The case for a single-pheno- 
menon view of the solid conductivity is therefore retained by 
the authors. 
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AN OPTIMUM RATIO OF COPPER LOSSES AND IRON LOSSES FOR A 
TRANSFORMER WITH VARIABLE LOAD 


By G. S. BROSAN, Ph.D., B.Sc.(Eng.), and D. O. BISHOP, Ph.D., B.Sc.(Eng.), Members. 


(The paper was first received 3rd April, and in revised form 18th June, 1959. It was published as an INSTITUTION MONOGRAPH in 
November, 1959.) 


SUMMARY 


When a power transformer is loaded so that it is operating at 
maximum efficiency the copper losses are equal to the iron losses 
The ‘efficiency’ is, of course, the ‘power efficiency’, i.e. the ratio of 
output power to input power. The same condition applies for maxi- 
mum ‘energy efficiency’ only if the load is constant for the period 
being considered. 

With most transformers the load varies considerably over a load 
cycle and the optimum ratio of copper losses to iron losses calculated 
on the assumption of constant load for a given period can only be, at 
least, a rough approximation. 

The paper suggests a method of obtaining a more accurate optimum 
ratio in cases where the load variation of the transformer can be 
estimated in advance. The assumption is made that the copper resis- 
tance is constant. 


LIST OF SYMBOLS 
Pp = Load on transformer, per unit. 
P; = Full-load iron loss, per unit. 
P. = Full-load copper loss, per unit. 
t = Time, 
7” = Efficiency. 
P,, = Average per-unit load over a given period. 
Pr =R.MSS. per-unit load over a given period. 
Nm = Maximum efficiency. 
Nme = Maximum efficiency based on the assumption of con- 
stant load. 
P = Constant per-unit load on transformer. 
W ) = Energy output over a given period. 


(1) DERIVATION OF OPTIMUM RATIO FOR ANY 
VARIABLE LOAD 


If we consider a transformer delivering a variable per-unit 
load, p, over a period t,, the energy output over this period will be 


ty 
0 


The energy absorbed in iron losses will be ° 


th 


P; | dt 
0 
The energy absorbed in copper losses, assuming constant resis- 


tance, will be 


ty 
P.| pdt 
0 
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Thus the energy efficiency of the transformer over this peri 


will be 
ty 
[ pdt 
~ 0 


4 


uf] = ty 7 ty ty 
| pat + P, | dt +P, | pdt 
0 0 0 


For an extreme value, = ==)! 


Differentiating with respect to p and equating to zero, ° 
have* 
ty 2 ty 


oT Ta ate NY 


is the average load over the given period, i 


[iat 


the average height of the load curve. This is P,, (per uni 
pdt 
0 

ty 
I dt 
0 
against ¢ over the given period. 

Let Pp = Square root of the average height of the curve 
(p”, t) over the given per-unit period. This is | 
r.m.s. value of the load over this period (per un 


is the average height of the curve of p? plot 


Then 
ty 
pdt 
0 Lae: 
ty =F R 
dt 
0 
Thus eqn. (2) becomes 
—P; + 2P.P2, — PPS = Pee re os 
from which 
P: 
a == 2P2, —) P2 


Hence, if the variation of p over a load cycle is known or | 
be estimated, P,,, and Pr can be found (graphically or otherw 
and the optimum ratio of iron loss to copper loss can 
determined. 


* Epwarbs, J.: ‘A Treatise on the Integral Calculus’ (Macmillan, 1930), p. 36 
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(2) SPECIAL CASE OF CONSTANT LOAD 

| When the transformer load p is constant 

| Py =Pr=P 

‘Substituting in eqn. (4), we have 


(5) 


But P,P? is the copper loss when the per-unit load is P, and 
thus eqn. (5) gives the commonly used relationship for maximum 
\efliciency, i.e. copper losses equal iron losses. 

| 


| (3) VALUE OF THE MAXIMUM EFFICIENCY 


From eqn. (1) ; 
| i pat | dt 
n= 0 0 


ty 
{ pat [ pat 
7S ae a : t 
dt dt 
7 i 
| pe 
Poe eps PUP? 


| I; 
| 3 =F Ot) ==) PP? 
| 


c 


(6) 


For maximum efficiency 
| P; = (2P2, — PRP, 
‘Substituting in eqn. (6), the maximum efficiency is 
| Ms Pe ms 1 
pop. PP, — P2P, + PPh” 1 2P,P, * 


(7) 


(4) PRACTICAL CONSIDERATIONS 

It is usual, when estimating the most desirable loss ratio, 
P,/P., to assume that the load, P, on the transformer, is constant 
‘over the load cycle and has the value P,,,. 

__ In other words, from eqn. (5), 

(8) 
| Cc 

Substituting in eqn. (6), the maximum efficiency when this 
assumption is made is obtained. Thus 


Ul SEREAAP 
| oe arre roe Prk 


ne te teem) SO) 


where 
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It is interesting to compare the maximum efficiency with the 
more accurate one given by eqn.(7). Thus, from eqn. (7) and (9), 
1+ kP, 
Li 2B Rs 


Tne 
me 
and 7,, will be greater than 7,,, if k is greater than 2P,,,. 


(10) 


Si Pi 
i.e. if PA, 
Pn 


or Ler e er. 


It can be shown that the r.m.s. value of any curve is always 
equal to or greater than its mean value. Therefore Pp is always 
equal to or greater than P,,, and thus 7,, is always equal to or 
greater than 1n¢. 

In other words, use of the relationship given in eqn. (4) will 
always result in an improved maximum efficiency when loads are 
varying. 


(5) EXAMPLE OF A SINUSOIDAL LOAD CYCLE 
Some indication of the order of the improvement in efficiency 
is gained by considering a load cycle which varies sinusoidally 
from 0 to z. 


Here P,,, = 2/a per unit 

1 é 
and Pr = V2 per unit 
Thus RemepP see UE e142 


Hence, from eqn. (10), 


tm _1+1-42P, 
Nie yl ad QI7P) 


Assuming that the full-load copper loss is 0:03 per unit, 


it fA) 5 
Nine 
There is thus a gain of 0-5°% in efficiency by using eqn. (4) to 
determine P;/P... 


(6) CONCLUSIONS 


The theory is, of course, approximate because no account is 
taken of the variation of copper resistance with temperature. 
It also ignores the relative prices of copper and iron. Neverthe- 
less, if the load cycle of a transformer is known, or even if it can 
only be estimated approximately, it will always be advantageous 
to use the ratio of iron loss to copper loss given by en. (4) in 
preference to the more usual one, based on the assumption of 
constant load. 

The saving in energy over a period may be considerable, 
particularly if the transformer is large and the form factor of 
the load curve is high. 
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By L. SOLYMAR. 


(The paper was first received 6th December, 1958, and in revised form 10th June, 1959. It was published as an INSTITUTION MONOGRAPH 
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SUMMARY 


The optimum design of non-uniform transmission-lines for a given 
bandwidth is discussed, and it is stated that the shorter the non-uniform 
transmission line the more violently its characteristic impedance 
varies. The concept of complexity of this characteristic impedance 
function is introduced and a design method is suggested for minimizing 
the complexity. 

Numerical examples are given to illustrate the method, and it is 
shown that it is possible to design tapers even shorter than the 
Chebyshev type. 

The method is capable of further extension. 


LIST OF SYMBOLS 
x = Co-ordinate in the direction of propagation. 


V(x) = Voltage along the line. 
I(x) = Current along the line. 

5 _ VO)/IX) — Zo) 
p(x) 


VU) -F Zo(x)" 
Zo(x) = Nominal characteristic impedance. 
y = a + j8 = Nominal propagation coefficient. 
= Nominal attenuation coefficient. 
a Nominal phase-change coefficient. 
Zo, Zo2 = Characteristic impedances of the uniform 
lines connected by the non-uniform line 


(Fig. 1). 
1 = Length of the taper. 
d 
F(x) — 2 Gloss Zo(x)]. 
u = Bl, y = 2x/l = Variables defined by eqn. (5). 
f(y) = A function defined by eqn. (5). 
p(—3/, u) = Reflection coefficient at the point x = — 41. 
A = An interval of the u-axis. 
© = Maximum permissible value of the reflection 
coefficient in the pass band. 
B = Complexity of the nominal characteristic- 
impedance function. 
A, = Lagrange multiplier. 
K, x, Co, Cy = Indefinite constants. 


(1) INTRODUCTION 


One of the principal uses of non-uniform’ transmission lines 
is for broadband matching of unequal impedances, e.g. two 
transmission lines of unequal characteristic impedance. Usually, 
it is required that the reflection coefficient of the taper shall fall 
within predetermined limits over a required bandwidth. 
Recently a number of solutions have been produced aiming at 
an optimum engineering design. Ifa taper is required to have a 
theoretically infinite bandwidth, the Chebyshev taper is 
undoubtedly the optimum design!>? (i.e. has minimum length); 
but from an engineering aspect it cannot be the best, since an 
infinite bandwidth is never demanded. We therefore ask the 
question, What is the optimum taper design if the bandwidth is 
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engineering design. 
Without going into the intricacies of the mathematics it caj 
be stated that an eae taper design cannot exist, for, in 


problem are concerned, it is nevertheless possible to desigi, 
tapers which are significantly shorter than the Chebyshev tape} 
and yet of simple shape, and the paper indicates the cay 
procedure. 


(2) THEORETICAL FUNDAMENTALS 

From the familiar transmission-line equations it is found thai 
the function 

— VOIX) — Zo(X) 

V(x)/U(x) + Zo(x) 


satisfies the first-order non-linear differential equation? 


dp Ml: »d [loge Zo(x)] 
de VR lee Pe 


=0 . ..@ 


If V(x)/I(x) does not differ appreciably from the nomina 
characteristic impedance, p?<1. This assumption is per 
missible if the line parameters change slowly and the ratic 
between the characteristic impedances of the two uniform lines 
Zo2/Zo1, is not too large. Introducing the notation 


1 [logs Zo(x)] 
2 dx 

gives a first-order linear differential equation* in p. The boundary 
condition (Fig. 1) at x =H is VGD/IG) = Zo = ZG) 


, 


Zoe 7 f 


F(x) = @ 


Z9 (x) 


BY 


O1 


UNIFORM LINE 


NON-UNIFORM! UNIFORM Line 
LINE 


1 


X=5l xX=0 X=h1 


Fig. 1—\Non-uniform transmission line section. 


i.e. p = 0. Under these conditions the differential equation (1 
gives 
1/2 


pe = [ F(z) exp | -2 i Gale | es (3 


Neglecting losses, y(x) = j8(x), a pure imaginary number 
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| 


"0 frequency. 


|| For x = — 4/ eqn. (3) gives 
be | ’ 
p(—H) = eV! | Foye Pedy (4) 
| ‘Substituting 
v= a Bl = u; FGyl) = f(9); p(— 4, ) =T@le*! . 6) 
sives 
1 
Pw) =4 | yey © 
' Inverting eqn. (6) (apart from a constant factor) gives 
Tene i Tweidu (7) 


i The problem is to determine f(y), and hence Z)(y), for any 
“specified Iu) function. Obviously the function ['(u) must 
“satisfy the following requirements: 


(a) Its Fourier transform must vanish outside the (—1, 1) interval. 

(6) Its absolute value must remain within prescribed limits, +0, 
in a given interval A. 

(c) At zero frequency its reflection coefficient, (0), is determined 
solely by the characteristic impedances Zo2 and Zo1, irrespective of 
the nature of the non-uniform transmission line. 


Before proceeding with the solution of the integral contained 
in eqn. (7), subject to the above requirements, it must be stressed 
that the problem is not simply the determination of the Fourier 
‘transform of Iu). For in the first place Iu) is not known or 
given as a function of u, but what is given is that I(w) must 
‘belong to a class of functions satisfying the above requirements. 

Similar problems have been encountered in connection with 

the determination of radiation patterns of certain aerials>»° and 
the mathematical approach will be very similar; the details of 
‘the analysis will therefore be omitted. 
_ In principle the above requirements could be met for an 
‘arbitrary (but finite) interval A lying arbitrarily near to the u = 0 
‘point and for arbitrary small 5. Whether the resulting non- 
uniform line is a physically realizable structure is another 
question, and it is safe to say that, other factors being equal, the 
smaller 6 the more wavelike f(y) will be. Moreover, if the f(y) 
function is rich in high-frequency Fourier components, the 
approximation (set out at the beginning) of neglecting the 
higher-order modes is no longer valid. However, judicious use 
of the method to follow will lead, in most cases, to a satisfactory 
solution. 

From what has been said it will be obvious that an optimum 
non-uniform transmission line does not exist, since for a given 
length of line, /, there is an infinity of possible f(y) functions, all 
of which satisfy the requirements (a), (b) and (c). 


(3) OUTLINE OF THE NEW APPROACH 


The function attribute called ‘complexity’-is now introduced 
and is defined as follows: It is proposed to call the function 
shown in Fig. 2(a) ‘simple’ and that in Fig. 2(b) ‘complicated’. 
It will be observed that large values of |Z¢(y)| in the interval 
(— 1, 1) are peculiar to a function having large complexity and, 
in general, vice versa. Consequently, the integral of the square 
of the second derivative taken over the interval (— 1, 1) could 
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» Further assume that B(x) = B, independent of x but proportional 
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Fig. 2.—Diagrams pertaining to the concept of function complexity. 


be taken as a measure of complexity. It will therefore be 
convenient to define complexity, B, as follows: 


1 
B=| [Z6()}ay (8) 
The physical problem can now be stated as follows. It is 


required to design a taper [Z(y) function] satisfying requirements 
(a), (6) and (c) and at the same time exhibiting minimum 
complexity. 

Mathematically we seek a function, Zo(y), which satisfies the 
conditions (a), (b) and (c) and minimizes the expression (8). 
From egn. (2) 


y 
Zo(y) = Kexp E fir @) 
where K is an arbitrary constant to be determined. 
Since the exponential function is monotonic, a good approxi- 
mation is obtained by considering only the exponent 


a4 
| tore (10) 
The expression to be minimized is therefore 
1 y mye 1 
B, =| {[J fond | \ dy =| [f’(») Pay (11) 
= =4 =1 


For simplicity, assume that I(u) is a real function, so that f(y) 
is even. Consequently, the Fourier transforms will contain 
cosine terms only, and this will simplify further calculations. 

The difficult part of the analysis is to satisfy the requirement 
(b), since it is impossible to put it in a form amenable to analysis. 
However, the difficulty can be circumvented by a number of 
procedures, as follows.* 

In the first instance one could, for example, prescribe the 
zeros of the I'(u) function at points u = ug, (kK =1,2...n), Le. 


1 
[ £009 cos uenydy = 0 (k = 1,2...1) (12) 
0 


Clearly, if the density of zeros in the interval (—1, 1) is 


* It must be realized, however, that the approach need not lead to the best solution, 
since the auxiliary conditions to be stated are not an exact statement of physical 
requirements. 
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increased, there is a strong likelihood for the I\(u) function to 
remain small in the whole interval. 

As an alternative procedure, the values of the function and its 
derivatives could be prescribed at points u = up, (k = 1,2...m). 
Let the value of the function at these points be 1/D,; of (0) (the 
value at zero frequency), and choose at the same points the 
derivative of the function to be zero. These conditions can be 
put in the form 


TO) 


1 
ik £(3) €08 uyiody = (k= 1,2. ey 


(13) 


1 
and [2 sin uz,ydy = 0 
0 
It would also be possible to assume, at any point u,, the first g 
derivatives to be zero. There would then be, in addition, 


1 
i yPf(y) sin u,ydy =0 (p odd) 
0 


ora IA) 
and ! 
[ y'f(y) cos u,ydy =0 (p even) 
0 


Evidently the function Z9(y) can be made to satisfy some or 
all of these conditions; it would also be made to satisfy any 
other suitable conditions. Clearly, the problem belongs to the 
class soluble by the calculus of variations, with the auxiliary 
conditions in the integral form [e.g. eqn. (12), (13) or (14)]. 

However, before the method is applied successfully, it is 
necessary to gain a certain amount of experience and skill. 
For, if the auxiliary conditions are chosen unskilfully, without 
any regard to the shape of Zo(y) curve, the resulting taper, 
although it will satisfy the bandwidth requirement, will be too 
complicated in shape (rich in high-frequency Fourier com- 
ponents) to be of any practical use. For any practical case one 
must therefore seek a compromise between the complexity of the 
Zo(y) function and the bandwidth. 

To illustrate the method, the auxiliary condition given in 
eqn. (12) is used. The problem is then formulated as follows: 

Required 


1 
[ [f(») dy = Minimum (15) 
0 
subject to the auxiliary condition 
1 
ie 608 FO 12 ae) (16) 


Tt can be shown that the necessary condition for the existence 
of the solution is that the function f(y) satisfies the Euler— 
Lagrange differential equation (8). (Whether or not the con- 
dition is sufficient will be obvious from physical reasoning.) 
The appropriate Euler-Lagrange differential equation for this 
case reads 


ni ae (Pr + A,£(y) cos uo} 


d n 
dy of’ cel 6) ba 2 p> A f(y) cos ust 250) (17) 
where the A,’s are Lagrangian multipliers. 
This reduces to 
f(y) = : Dy Ay, COS Upy (18) 
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Integrating and introducing the new variables p, instead | 
£X,/uz gives 

f(y) = by py, COS Uy + Cyy + Co aM, (al 

The unknown py, are determined with the help of eqn. (iq 

while the constants Cp and C, are determined by the followii) 

procedure. 


The application of the boundary conditions to the nomin| 
characteristic-impedance function yields 


Z(y¥) = Zo, ify = —1 } ( 
Zo(¥) = Zo2 ify = 1 

Substituting in eqn. (9) with K = Zo, gives 
1 fi canis wey . 
3 fCyydy = 7 108. me oy Cee eG ; 


which is a good approximation to the zero-frequency reflectic 
coefficient provided that Zy>/Zo, is less than 2 (see Reference 2 

The integration constant C, can be determined in a number | 
ways, and the logical procedure is to determine it by minimizir 
the expression 


1 
BC) = | [Pay aides cue 


In the particular application, C; = 0, for simplicity. 


(4) AN APPLICATION OF THE METHOD 


To be able to compare the numerical results with tho: 
obtained by different methods and already published elsewher 
it will be convenient to choose [\(0) = 0:2. 

Let the first zero of the I'(w) function be at 7, as in the cai 
of the exponential taper. If the second zero occurs at 32/2 an 
the method described above is applied (details of the calculatic 
can be found in the Appendix), we obtain a curve as shown i 
Fig. 3. It will be observed that in the interval 2:82-5:12 tk 


Fig. 3.—Absolute value of the reflection coefficient |['(w)| as tt 
function of u. 


Present design. ——— Chebjshev taper. 


reflection coefficient is already less than it would have been fe 
the Chebyshev taper having the same first zero. 

If the calculations are continued for various values of tk 
position of the first zero, it is found that, in general, the small 
the first zero the larger will be the value of 5 and the mot 
complicated the Z(y) function . Conversely large values of th 
first zero are associated with a long taper, small bandwidth 
small values of 6 and a simple Z)(u) function. 
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| Fig. 4.—Complexity, bandwidth, and I(0)/6 for the taper (D) as a 
function of the position of the first root of the function; for 
comparison, I'(0)/6 for the Chebyshev taper (T) is also plotted. 


a 


__ Fig. 4 shows the magnitude of [(0)/6, complexity and the 
| bandwidth as a function of the position of the first zero. The 
| results are contrasted with the Chebyschev taper, which, of 
| course, has infinite bandwidth. It transpires from this Figure 
| that for a given value of 5 the new taper is substantially shorter 
| than the Chebyshev taper. It can be said that, with the method 
| outlined, a reduction in length of some 10-30% is easily achiev- 
| able. It must be pointed out, however, that this taper has a 
| finite bandwidth of 20-30%. Nevertheless, this is quite adequate 
| for most practical applications, but, if désired, there is no reason 


=O2 fe) 0-2 


0-4 06 o's 


Fig. 5.—Variation of the normalized taper impedance, Zo(y)/Zo1, as 
a function of y, the normalized taper co-ordinate. 
(a) fore = 0:5. 
(b) fore = 0. 
(c) fore = — 0-4. 
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why the computations should not be carried further and an even 
better result obtained. On the other hand, although mathe- 
matically the Chebyshev taper has an infinite bandwidth, in 
practice this is not so for a number of reasons, e.g. the junction 
discontinuities have been neglected and the higher-order mode 
effects disregarded. 

The resultant Zo(y)/Z, function of the new taper is shown 
in Fig. 5 for three different taper lengths [positions of the first 
zero of the I(u) function]. 


(5) CONCLUSIONS 


A method based on the variational calculus has been developed 
and applied to the design of transmission-line tapers. The 
validity of the method is subject to the following approximations 
(in common with all other known methods): the effect of higher- 
order modes is neglected; the ratio of voltage to current in the 
line does not differ appreciably from the nominal characteristic 
impedance (this implies that p < 1); the nominal phase-change 
coefficient, 8, is a constant which is the characteristic of the line. 

In applying the method to practical cases, two quantities must 
be given, namely the bandwidth and the maximum permissible 
value of the reflection coefficient, 5, within it. 

The tapers designed by this method have finite bandwidth, 
but this, as explained above, is no limitation of the method. 

In principle, the maximum permissible value of the reflection 
coefficient, 6, in a given bandwidth could be made as small as 
desired, but the resulting taper would be very complicated and 
then the assumption about neglecting the higher-order modes 
would be invalidated, and in any case the taper would be too 
complicated to be of any use. 

It is also concluded that an optimum design does not exist. 


(6) ACKNOWLEDGMENTS 


The author wishes to thank Mr. L. Lewin and Dr. A. E. 
Karbowiak for reading the manuscript. 

Acknowledgment is also made to Standard Telecommunica- 
tion Laboratories, Ltd., for permission to publish the paper. 


(7) REFERENCES 


(1) KLopFENSTEIN, R. W.: ‘A Transmission-Line Taper of 
Improved Design’, Proceedings' of the Institute of Radio 
Engineers, 1956, 44, p. 31. 

(2) Couns, R. E.: ‘The Optimum Tapered Transmission Line 
Matching Section’, ibid., p. 359. 

(3) WALKER, L. R., and Wax, N.: ‘Non-Uniform Transmission 
Lines and Reflection Coefficients’, Journal of Applied 
Physics, 1946, 17, p. 1043. 

(4) BoLinpeER, F.: ‘Fourier Transforms in the Theory of Inhomo- 
geneous Transmission Lines’, Proceedings of the Institute 
of Radio Engineers, 1950, 38, p. 1354. 

(5) WoopwarD, P. M., and Lawson, J. D.: ‘The Theoretical 
Precision with which an Arbitrary Radiation Pattern 
may be obtained from a Source of Finite Size’, Journal 
LE.E., 1948, 95, Part III, p. 363. 

(6) Kovacs, R., and Sotymar, L.: ‘Theory of Aperture Aerials 
based on the Properties of Entire Functions of the 
Exponential Type’, Acta Physica Academiae Scientiarum 
Hungaricae, 1956, 6, p. 161. 

(7) Taytor, T. T.: ‘Design of Line-Source Antennas for Narrow 
Beamwidth and Low Side Lobes’, Transactions of the 
Institute of Radio Engineers, 1955, AP-3, p. 16. 

(8) WeINstock, R.: ‘Calculus of Variations’ (McGraw-Hill, 
New York, 1952). 


104 SOLYMAR: 


(8) APPENDIX 
The reflection coefficient at u —0 is ['\(0) = 0:2, the pre- 
scribed zeros of the f(y) function are 7 + ¢ and 37/2. According 
to eqn. (19), f(y) can be written in the following form: 


3 
f(y) = py cos (7 + €)y + p2 COS sy + Cp (23) 
For the determination of j11, 4. and Cp we can use eqns. (16) 
and (21). Hence 
yypy + Byoog + by3Co = 9 
bo1p4y + byafhy + by3Cy = 0 (24) 
bsp + b32M@2 + b33Co = TO) 
where 
1 1 sin 2(7 + €) 
== 2 | y : DS 
by, J,cos (a + €)ydy “hi War + 6) | (25) 
: 377 
by — by — [cos (7 =i €)y cos Sey 
he sip eee a Sse COS € (26) 
Px ey BNE 
(7 + 6)? — (F Yi 
b ss pr tien sin € Q7) 
bys = 03, = ce Gunn? ie 
1 37 1 
=> 2a =S -«< 
ay [eos a Idy = 5 (28) 


A NOTE ON THE OPTIMUM DESIGN OF NON-UNIFORM TRANSMISSION LINES 


2 
30 


1 377 
bo3 = 532 = [cos 5) ydy = 


1 
hye je =f (30 

When j1;, [42 and Cy have been determined from eqn. (24) 
the reflection coefficient can be calculated with the aid o 
eqn. (6), whence 


Tu) = 181) + p282(u) + Cog3(u) (31 
where 
1 
gu) = [ cos (7 + &)y cos uydy 
0 2% 
Se ee G2 
(7 Fey : 
1 37 3 COS ul 
go(u) = if cos av cos uydy = ae Ga 
Ae been (8a 
. ( 2 ) 
1 : 
g3(u) = [ cos uydy = pay (34 
0 Uu 


For different values of ¢ the I'(u) curve can now be plotted 
from it the I\(0)/6 ratio and the bandwidth can be determined 
The Zo(y) curve is calculated from eqn. (9), while the complexit 
is obtained from eqn. (11). 
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SUMMARY 


The numbers of positive, real and complex zeros of a polynomial 
with real coefficients are shown to be related to the number of changes 
in sign along one or two sequences of determinants whose elements 
consist of the coefficients of the given polynomial and of its derivative. 
This in turn is related to the number of negative signs occurring in a 
certain continued fraction. Some numerical examples illustrate the 
application of both methods, and the simplification of and the relation- 
ship between the two sequences of determinants is also discussed. 


(1) INTRODUCTION 


In a number of engineering applications it is of importance to 
be able to investigate the character of the zeros of a polynomial 
directly from its coefficients, i.e. to determine the number of 
complex zeros, the number of real zeros and the number of 
positive zeros that the polynomial possesses without actually 
calculating them all first. 

For example, the roots of an algebraic equation might corre- 
spond to the optimum value for a component in an electrical 
circuit and it is desired to know how many positive, or real, 
solutions are possible or whether, in fact, any such exist at all. 
Again, in the synthesis of linear passive driving-point impedances 
the real part of the impedance must be non-negative for all real 
frequencies, and checking whether or not this requirement is 
satisfied for a given function reduces to a consideration of the 
positive zeros of a polynomial. 

The usual mathematical method of investigating these problems 
is to use Sturm’s theorem, details of which are given in most 
books on the theory of equations.!»2 However, Sturm’s theorem 
results in unnecessarily complicated, and hence lengthy, calcula- 
tions if the numbers of positive and complex zeros are all that is 
required, and simpler methods are given in the following Section. 
It is shown that the problem reduces to a consideration of the 
changes in sign along one or two sequences of determinants whose 
elements consist of the coefficients of the given polynomial and 
of its derivative.* This in turn is shown to be equivalent to 
investigating the number of negative signs occurring in a certain 
continued fraction. This last result, stated in Section 2 in the 
form of two theorems on continued fractions, differs from the 
J-type continued-fraction test given by Wall. 

Some numerical examples to illustrate the methods are given 
in Section 3, and the simplification of and the relationship 
between the two sequences of determinants is discussed in the 
Appendix. 


* These determinants are the same as those that have been used, for example, by 
Fuller,3»4 to express the conditions for a polynomial to have all its zeros real and 
negative, and also for a polynomial to have all its zeros real. 


Correspondence on Monographs is invited for consideration with a view to 
publication. F 

Dr. Cutteridge is at the Manchester College of Science and Technology (Faculty of 
Technology, University of Manchester). 


(2) TWO THEOREMS ON CONTINUED FRACTIONS 


Let f@)=a,x" +a, x. "'!+...+ax+a. . (1) 
where a, G,_4,..- @ are real, 
and f’(x) = pened) + (el OS sat +...-4 bo = (2) 
Then, in general, we can write 
TCinn 1 
£G) i G3) 
1x + 
1 
Bi ok oe 
AnX + B a 


1 


1 
CHEE Soa 
k 


where «,, 8; 40 and k =n in the absence of multiple zeros 
of f(x). 


The following theorems will now be proved. 


Theorem 1. 


The number of positive zeros of f(x) is equal to the number of 
negative 8; minus the number of negative «;. 


Theorem 2. 


The number of real zeros of f(x) is equal to (k — 2N) and 
the number of pairs of conjugate complex zeros is equal to N, 
where JN is the number of negative «;. 


Both Theorems 1 and 2 yield the numbers of distinct zeros in 
each case, the multiplicity of zeros not being counted. 

A proof of these theorems can be obtained from Sturm’s 
theorem and most easily from one of the modified sets of Sturm 
function given by Trudi;® details of these functions are also to 
be found in Muir.” 


The functions in question are the set Xp, Xj, Xo, ... X,, 
where 
Xo = XxX, = |a, Qn—1 0 ) 
0 beet 
bn—1 On-2 * 
a Ope deo aeage) O 
0 Bs Qn—1 aos OP lag (4) 
X =|0 0 n—1 bn_2 1 
0 bn bn—2 bn—3 x 
bn—1 bn_2 n—3 bn—4 oe 
etc. J 
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and 


b,_1 = NA, 
IY py es Ui eres 
: ; A | (5) 


bo = ay 


At x = 0, the functions X, become 


5 
X00) = 1 0) = — ae pet 
ay, Gr—1  %n—-2  n-3 
0 Qn G—1  4n-2 " 
X,(0) = (¢) bee baa, by3 | 
b, 1 by 2 be 3 bn 4 
etc. 


2 


Every term X);,, has a negative sign which can be removed 
by rewriting the bigradients in eqns. (6) in a form analogous to 
the Hurwitz determinants. Thus 


Xo(0) = 1 Xi) = A, X,(0) = A 


where A, is the ith-order determinant 


.. X,0) = A, . 


b, i! b, 2 by 3 Dy 4 
Gn GQn—-1 U—2 G-3 
Bey a) b,_3 
ay, Qy—-1 4n—2 
CURL gD Res : 
0) 0 An anes 
0 0 0 bn—4 
The number of changes of sign in the sequence 
Ag A Age Age (9) 
is equal to the number of negative terms in the sequence 
Ay, A,A,, AyAg, .. . Aoz_oAoz (10) 


Using the same notation, the highest powers of x and corre- 
sponding coefficients in the successive functions X; are 


Wa yy Gi Nak, QNeX Gy Nae 


(11) 


Since a, and b,__,; have the same sign the number of changes of 
sign in sequence (11) when x = + ©O is equal to the number of 
negative terms in the sequence 


azb,1A3, aZA,As, aZAsAz, . (12) 


which in turn is equal to the number of negative terms in the 
sequence 


AY AS ai 


D 
: aN y4—-3ANoK—1 


by_1A3, A3As5, AsAq, ... Ag, —3Aox—1 (13) 


The quantities «; and 8; in the continued fraction in eqn. (3) 
have been given by Bader® as 


Qn A? 
a = fae By " Ne 
Uses os 
ie AA; Pa = A,A, - (14) 
Led EN 
Ba gAzi—-1 Ay; Ao; 
sh os ale 


Thus the number of negative terms in sequence (13) is equal 
to the number of negative a terms, and the number of negative 
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terms in sequence (10) is equal to the number of negative fl 
terms. But as the functions X,; are a set of Sturm functions the 
number of sign changes in sequence (9) minus the number oj 
sign changes, when x is positive, in sequence (11) is equal t te} | 
the number of positive zeros of f(x). 
positive zeros of f(x) is equal to the number of negative B terms} 
minus the number of negative « terms, as stated in Theorem 1]” 

Consider now the total number of changes in sign in two} 
sequences A and B, sequence A being an arbitrary sequence ol) 
(k + 1) positive and negative terms and sequence B obtained) 
from sequence A by changing the signs of the 2nd, 4th, 6th, . . j ies 
terms. Evidently the sum of the number of sign changes i | 
sequences A and B must be k. Thus if N is the number of signi 
changes in sequence (11) when x = + © then (kK — N) must 


so the number of real zeros of f(x) must be (k — 2N). As k ig) 
the total number of distinct: zeros, f(x) must have N pairs of 
conjugate complex zeros. | 

But as the number of sign changes, when x is positive, in 
sequence (11) is equal to the number of negative « terms the 
number of real zeros of f(x) is equal to (k — 2N) and the number}: 
of pairs of conjugate complex zeros is equal to N, where N is the y 
number of negative « terms, as stated in Theorem 2. | 


(3) NUMERICAL EXAMPLES © 
(3.1) Example 1 


Take f(x) = x? + 2x? 423 = 2 
then f'(x) = 3x? + 4x +3 (16) 
f’(x) 1 am 
and Oi x i i a7) 
3: 1 Ot 1 
yas 2 1 
Gab w BREE) HSE | 
6 B81 
d 
6 


Thus the number of negative « terms is 1 (out of 3), implying 1 
pair of conjugate complex zeros and 1 (i.e. 3 — 2 x 1) real zero. 
As the number of negative 8 terms is also 1 there is no difference 
in the numbers of negative « and f terms and there are therefore}, 
no positive zeros. Hence f(x) has one pair of conjugate com-| 
plex zeros and one negative zero which may easily be verte 
since | 
f(x) = x3 + 2x2 44+3x 42=(« + I(x? +x 42) 


=(x+)@+12+¢/1D6 +12 =A 
(3.2) Example 2 
To illustrate the case of multiple-order zeros, consider 
f(x) = x4 + 2x3 + 5x? +4x4+4= (7? +x +42) 
= [(% + 1/2 + f/712)(% + 1/2 —j/7/DP 


(18) 


(19)| 


Then f'(x) = 4x3 + 6x? + 10x +4 (20) | 
and +) = aa ee (21) 
eZ x 1 7 

7 3g FSF 


Multiple-order zeros are indicated by premature termination 
of the continued fraction. Details of these could be obtained, 


f required, by inspecting the last divisor in the process of 
btaining the continued fraction. By Euclid’s algorithm this 


In the present example, as one negative and one positive x 
| }oefficient in the continued fraction indicate one pair of conjugate 
eq! ;omplex zeros and no real zeros, the conjugate complex zeros 


(3.3) Example 3 


Th | Eyen when the given polynomial is such that a continued- 
fraction expansion of the same type as in eqn. (3) is no longer 


| Be es eS 1) (22) 
| f” 2 1 
“Then ay Tiger C) 
j i s3 5+ zig 
th} 


i But, as the continued fraction in eqn. (23) is not of the same type 
i as that in eqn. (3), no conclusions can be drawn with regard to 
the character of the zeros of f(x); e.g. it would be incorrect to 
‘say that one negative « coefficient implied that f(x) contained 
| one pair of conjugate complex zeros. 
| Now modify f(x) by adding a term ax. 


| Then 
£09 _ et a 1 
Mf) x2+ox—1 x 1 
2 
2 4 1 
U) aa 1 


aS ae 

4 4 
. 2(a +=) = (a+°) 
| Mtn... (24) 


Let a ->0 and examine the signs of the coefficients in the 
continued fraction. Both « terms are positive, indicating no 
complex zeros and 2 real zeros. Whether a— +0 or —0, 

| there is one negative 8 term and hence one of the real zeros is 
I} positive and the other negative. That this is the correct result 
) is easily verified. 


| (3.4) Example 4 


| This example illustrates the determination of the character of 
| the zeros of a polynomial by an examination of sign changes in 
| sequences of determinants. 

From the argument connected with sequence (11) of Section 2, 
| if N is the number of changes of sign in the sequence 


| By wd, a,A3, a,As, eee AnArK—15 see AnArn—1 (25) 


! with A,,_, the highest-order non-zero determinant in the 
| sequence, then the given polynomial has k distinct zeros com- 
| prising (k — 2N) real zeros and N pairs of conjugate complex 
Zeros. 
For convenience in calculation, the Hurwitz type determinants 
| A,;—; in sequence (25) can be reduced to about half their original 
order by using the condensation procedure described for the 
Hurwitz determinants in a previous paper;? some further details 
: are also given in the Appendix. 
| Thus sequence (25) becomes 


(26) 


* This is one of a number of ways of modifying the polynomial so that the standard 
form of continued fraction results. Alternatively, the non-standard type of continued 
fraction originally produced might be investigated. 


aPn—15 a,D3, a,Ds, eres Q,DoK—1> gs ie G,Don—1 
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where a,b, _, is always positive and D,; ; is the condensed 
determinant equal to A,;_, 

To illustrate this method take the same cubic polynomial used 
in Example 1, namely x? + 2x*+3x+2. Adapting the 
results for the condensed Hurwitz determinants given in 
Reference 9, the highest-order determinant to be considered, 
Ds, is given by 


Gada 1 aDn — Dn 3 
a GQn—2 GQn—-1 U-3 
elesbe: eos petals | 
D ae aah By 1 On ee n—1 bn_3 - 27) 
Andy —3 Gn —1 fo Qn—-2 U-3 
Open bn_3 bn—2 bn—3 


D, is the second-order principal minor indicated by the dotted 
lines in the above determinant and a,b,_, is the first-order 
principal minor. 

For the polynomial x? + 2x” + 3x + 2, 


Gn = 1 Gay = 2 Gg = 3 \Qn-g = 2 
: (28) 
Dees => 3 bn—2 — 4 bn—3 a 3 
The terms @,4, a,—5, etc., and 5,4, b,_5, etc., are zero. 
Substituting these values in eqn. (27), 
3 4 
2 3 2,2 
Dome Weweia 4 B 3 Lo .stog (29) 
; oye ale) A 
Bg! 4 3 


Thus D, is negative, indicating one change of sign in sequence 
(26), and consequently the polynomial x? + 2x? + 3x + 2 has 
one pair of conjugate complex zeros and one real zero, exactly 
as before. 

The number of positive zeros is given by the excess in the 
number of sign changes in sequence (9) over those in sequence 
(26). As before, the determinants in sequence (9) can be con- 
densed to half their original order giving 


ie Den Dae Dy . (G0) 


For the present example the highest-order determinant to be 
considered, Dg, is given by 


Dae bn-3 


be aa 
a, GQn—1 an Q&n—2 Rat are 
De = NB, spe Pnes Beta (31) 
is = 2 3 . 
it be Di 194n—3 ote o a | Pn—2n—3 
n n Gn—1 U2 
Dn = 1Gn_3 bn—24n ee: 


The second- and first-order principal minors indicated by the 
dotted lines in the determinant Dg are equal to D, and Dy, 
respectively. 

Substituting the numerical values, 


261 6,| 
Gy 12-8 
6 8 6 


As D, (=2) is positive there is one change of sign in sequence (30), 
which is the same as the number of changes of sign in 
sequence (26). The polynomial x3 + 2x2 + 3x + 2 has, there- 
fore, no positive zeros, agreeing with the conclusion reached in 
Section 3.1. In this case the result is immediately obvious, since 
the polynomial, having positive coefficients, can never vanish 
for positive x. 


See O) ale: (32) 


108 


(4) CONCLUSIONS 


It is shown that the number of sign changes along one or two 
sequences of determinants can be used to determine the number 
of positive zeros and the numbers of real and complex zeros 
of a real polynomial. Two theorems are developed to link these 
sign changes with the number of minus signs occurring in a 
certain continued fraction. Both results provide useful methods 
for investigating the character of the zeros of a polynomial. 
When using the determinantal method it is advantageous to 
condense the determinants, thus reducing their order to about 
half. This is illustrated by a numerical example in Section 3.4, 
and further details are given in the Appendix. 
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(6) APPENDIX 


The Appendix discusses.the condensation and subsequen 
simplification of the Hurwitz-type determinants A,; and a,A,;_ 
of sequences (9) and (25) and comments on their significance 
in the theory of equations. 

Consider first the case of the determinant A,, of order 2n 
Adapting the condensation procedure of Reference 9, the follow 
ing result is finally obtained: 


D2, aR 
Qn An —1 24;,An—2 3A,An—3 44,Q,—4 5a,Qn—5 
| “= 
Mala > Gn—1 —3ay_3 An—-1 ee Qn—1 —Sayn_s5 an—1 64y 6 
ie ay 4n—2 Qn 2An—3 an 3Qn—4 Qn 4an_ 5 
| 
a —4Aa a 3a Qn—2 —4an_5 (eiag) SS 
Baa ia he LEGA) AEs. Pa CA ERY 17) a te uy 6A, Ay — a +7, Ay 
Seem! 24,—3| |@n—1 — An-3 Mana An—1 Daa es antl An, 3An—5 . 
| 
IQ, 24. =0d,—5| a5 9 40,8 @, 4° 34, Qn—1 —74,—7| |Qn-3 —4@n— G,—1 —84,— 
40,Qn—4 n—1 3 n—§J n—2 > n—5 +64,An—6 n—3 n 5) n—1 5 n—T n—3 a8 ugh 1 Ae. 8 
\Gn Q,—4| |4n—1 ay C= 4 ay Gn—6| |4n—2 n—§ n n—T 
lay _1 —6@,—6| |@,-2 —5@n_ Qn—3 —4Qn—6| , |@r—1 —84n—g| |Qn—4 —3@n—6| , |Qn—2 —74n— 
Sad, _'s n—1 n—6 n—2 n 6474,a,—7 n—3 n 6) 4 n—1 6 n—8§ n—4 n—6 n—2 5 n 31+-9q aa igs 
\Qn Ady 5 Qn-1 Gan—5 n—2 2an_—5 ay, An—7| |A&n—3 Gn—5 an—1 an—7 
(33 
where D>, is symmetrical and of order n and equal to A,,. Values of D>, for n = 3, 4, 5 and 6 are 
jas 818) |) Yentob | Water, aYehyors n= 4: 4403; 2a4ay 3a4a, | Aaa 
ae awe Sa\e la; —4ao 
a Ao} | 2a,a,‘|\* 1 3a3a 
2 2 0} | 442 ; 3%0 
430 a; ay | 2232 M4 ay lay 2ay 
oe sae a, —4a _ 
(3a3a9 247d aa 3agay | : Of eae <0 2ara ‘ql 
a4 ay) |a3 a ; 
4444) 33a 2a7a a, 
asa4;  2a5a3 3a5az 4asa,; Sasa 46a aga, 3a6a3 4agaz Saga, : 6a¢a 
; ‘ 

a, —3a la, —4a ‘la, —Sa as; —3a a; —4a a; —Sa : |as —6dp| : 
2asa, \“4 2 | 4 5 1 bias 5 |! Aayao 2aga, |7 3 5 : 2 5 5 1 : (5 Z | | Sasa 
as a3 \a5 ay 1 |@s5 ay a6 a4! a6 a3 A ay | 146 ay] } 

a, —4a,| |a,; —3a, ila; —4ao as —4a,| lag —3a ‘la, —4a a4 —5do| | 
Sarat +5asd | 3a3a, 3aga 2h Saca, whet 14 6aca 4 9)! daa 
2 las 2a, lag ay > goa ass o> gunna las 43 1 4s 2az|* nO | as” Sage 
a, —Sao a; —4ay a, —3a as —Sa;| |a, —4a a ‘la; —4ap| | 
4asa - | 2ara 4aca 1 6a-a 3 1 5a<Ap ! 3 0 34,0 
wT las 3a, (a4 2a, 123 Gil a ade "7 |as 3a2| las 2a ee. a4 age a, 2ay| 9 
Sasdq 4444 3a3a9 2ar% Saga, |75 7 8% Barat a AO 2 aa ' Qaya 

a6 4a, as 3a, a4 1 a3 ay : 2 
6a6a Sasa 4a4a 3434 24nd a; 


For each of the values of n the determinant given is the 
highest-order determinant in sequence (30); the minors indicated 
by the dotted lines are the remaining determinantal members of 
sequence (30) in each case. 

| Consider now the determinant a,A,,_,. This is of order 
(2n—1) and is reduced to the following symmetrical determinant, 
(a,D5,—1, Of order n, by applying the appropriate condensation 
procedure from Reference 9 and simplifying the result: 
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As before, each determinant given above is the highest-order 
determinant in sequence (26) and the minors indicated by the 
dotted lines are in each case the remaining determinantal 
members of sequence (26). 

Now the determinants D5, and a,D>,_, can be shown to be 
simply related. 


GD, 4 = 
3 
na, @ — 1)a,a,_ (1 — 2)anQ,_2 (n — 3)a,an—3 (n — 4)a,a,,_4 : 
-1)a,a,—1 Qn —1 20, an—1 Sans Qn—1 4a,_4 Gaeat San_5 
a, (n—1)a,_; a, “—2)a, > An (N—3)a,_3 la, (n—4)ay_4 
as& 3a a 2a, a 3 4 
_-2)a,a n—1 n—3| |4n—2 n—3}__4. n—2 an-4 an=2 (Of ae 
| nn—2 GA, — Oe 2 —6a,a 
a, (4—2)a,_ >| |Qn_, (2—2)a,_> Bit idyaqilt=3)ages OS) An—1 (N—4)ay_4 ea § 
-3)4,An—3 cr Aan —4 @n—2 34n—4 —5a,a 5 eas 2m — 4) An—1 Ay 6] |Gn—3 Vy 25 = Gn—-1 ea) 
p82 3| (any 3) a)” * [an_p (A—3)ay_9| | [ty —4Qn_5| |@n—2 (1-4) Oy _4| In Sy 
-4)a,a,_4 Qn—1 5an—5 Qn —2 Na §| 64a 6 an 3 3an_5 is an—1 Tan 7 QAn—4 2An—5 Eis Qan—-1 84n—s|_ 4, 
a, (N—A)ay_4] |Qn_1 (1—4)a,_4 IQn—2 (—4)ay_4| "1a, —Say—6| Gp 3 (2—4)ay_4| | |@, —6a,—2| 
lie 
Values of a,,D>,,_, for n = 3, 4 and 5 are For n = 3, 
a3 
p= 3 a,Ds5 = (3)d. (35) 
3a2 : 2a,a a3a Form = 4, 
3 322 34 a 
an asDy = (“*)Ds (36) 
2a,0, | 2a;| } |az 3a a 
2 2a|}|a3 ay and, in general, 
SS a, 
ana, (2 340| |@1 240 A,Don—1 = (3) Pm . (37) 
3 a3; ay a, a ; ; : : 
A proof of this relationship for n = 3 is as follows. In the 
n= 4: determinant a,,D>,_, for n = 3, take 
4a | 3a,a OanG ane 2nd column — (a,/a3) < 1st column, 
4 | 24403 442 4a} 
Baers 2! i 3rd column — (a;/a3) X 1st column 
3.4403 a3 2ay a3 3a, a3 4a and take out a factor a3/a) from the Ist column. On inter- 
a4 3a3| | a4 2a, a4, ay changing columns 1 and 2 and then columns 2 and 3, and 
fe ad BCT = multiplying columns 1 and 2 by —1 we have the value of 
2a4ay by Ay 42 ! —Aajay | 92 340 (a,/49)D2, for n = 3. 
ae Bi This simple relationship between the two highest-order deter- 
a3 4a Pages apy: minants in each sequence means that it is sufficient to consider 
a4 a, a nM “4 bs only one such determinant, and all the determinants in both 
ate Zed sequences are either equal or simply related to the minors of 
a) this one determinant. Since the determinants D,, are seen to 
be rather simpler than the determinants a,D ,,_ 1 it is convenient 
Sag i 4asaq 3a5a3 2a5az asa, to take the D,, as basic and to refer the others to this. Thus, 
a i i for example, if D is the value of D,, when n = 3, sequence (30) 
4asag a4 2a; a4 3a, a4 4a, a4 5a can be written 
as 4a, i as 343 ; as 2a as ay Y D337, D33, D e fs . . (38) 
3asa, Q4 3a2| \a3 2az|__ dak i|a3 3a, ey ieee A 4ao and sequence (26) can be written 
as 3a3| \a4 343 a4 2a, SN pete a a3 a3 
ween nen nnn nn nnn nn nn nnn nnn nnn enna Fp sz op? pat . . . Ow (39) 
ag 4a;| |a3 3a a, 2a a, 3a 0 0 0 
2a 4 FQy) 143 Ia, 2 444) 2 240 : 
572 lq. 2a,| lay 756% a; 2a, 4a4ag 3 || Where D3, D391 are minors,* not cofactors. 
| reer * The notation used here is that the first and third suffixes indicate rows and the 
a4 Sao a3 4a az 3a ay 2ag second and fourth indicate columns. With this notation the result corresponding to 
asa a a a a sequence (39) for the case n = 4 is @4D433221/@0, @4D4332/40, 44D43/a0, 44D/ao, where 
5 1 4 1 a3 ay (42 @y D is now the value of D2, when n = 4. Similarly for higher-order cases. 


n—24n—6+*° 
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Writing A for the last term [=(a,/ao)D ,| in sequence (26), it 
is seen that the sign of A has a great significance on the possible 
character of the zeros of the polynomial. Since the first term 
of seqnence (26) (= na?) must always be positive, A> 0 indicates 
0, 2, 4, 6, . . . sign changes in sequence (26), i.e. 


A > 0 indicates 0, 4, 8, 12, . . . complex zeros in the polynomial 
(40) 


and A < 0 indicates 1, 3, 5, 7, . . . sign changes in sequence (26), 
ie: 


A < O indicates 2, 6, 10, 14, . . . complex zeros in the polynomial 
(41) 


A might therefore be described as the discriminant of the 
polynomial, and it is in fact proportional to the usual form of the 
discriminant in the cases of the quadratic, cubic and quartic. 
In the case of the cubic, for example, A is equal to —108 times 
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the usual discriminant. The above results therefore provide aj 
generalization of the idea of the discriminant of a polynomial) 
for the quintic and higher orders. 

The following observations can be drawn from statements (40)| 
and (41). For the quadratic and cubic the sign of the discrimi-} 
nant is always sufficient to determine exactly how many zeros) 
are complex; in each case A < 0 indicates two complex zeros} 
and A> 0 indicates that all zeros are real. For the quartic] 
and quintic, if A < 0 it is seen that there are exactly two complex) 
zeros, whereas if A > 0 there is ambiguity in that there may be 
either 0 or 4 complex zeros in each case. In the latter case the} 
signs of the appropriate minors must also be determined in order} 
to overcome this ambiguity. For polynomials of the 6th degree} 
and higher the sign of the discriminant alone is never sufficient} 


quartic correspond to well-known results (see, for example, / 
References 1 and 2) in the theory of equations. 


DISCUSSION ON 
‘THE STABILITY CRITERIA FOR LINEAR SYSTEMS’* 


Mr. A. T. Fuller (communicated): It would be interesting to 
know if Dr. Cutteridge was led to his condensed determinants 
from a consideration of Bezoutian determinants, which occur 
in the theory of elimination.A 

I have recently come across some German work on the 
stability criteria. In 1953 L. Cremer® showed that these may 
be reduced to the requirement that (if a, > 0) 


Aa aN Aas 


Qo, 43, 42, -.- 


(A) 
(B) 


should all be positive. Comparison with the results given by 
Dr. Cutteridge and by myself© shows that some of the coeffi- 
cients in sequence (B) are redundant. Incidentally, L. Cremer 
also proved that sequence (A) may be replaced by 


AS DE Nae ae (©) 


In 1957 EffertzP obtained condensed determinants similar to 
those of Dr. Cutteridge. However, Effertz’s proof depends on 
a paper" which has apparently not yet been published. Effertz’s 
criteria include redundant members, in that they require all 
sequence (B) to be positive. Effertz has also treatedF’S the 
network proof of Biickner’s canonical form.#:! 

Mr. J. W. Head (communicated): While Dr. Cutteridge is to 
be congratulated on the neat way in which he has welded 
together various alternative stability criteria and shown the 
connection between them, I would like to suggest that far the 
most important of these criteria is that the zeros of the even 
and odd parts of a Hurwitz polynomial are simple, purely 
imaginary and separate one another, and that this criterion is 
much more directly useful than is generally supposed. 

There are many practical cases in which we have to deal with 


and Baer 


ees O. P. D.: Monograph No. 328M February, 1959 (see 106 C, 
p. S 


a polynomial f(p) of degree not exceeding 7, so that the 
polynomial 


fi(p) = p™ + agp® + asp> + agp* + a3p? + arp? + ayp + a ] | 


= p(p* + B,)(p? + Bp? + B3) 
+ ag(p* + x1)(p? + x2)(p? + x3) 


will provide a good illustration. If f>(p) is a Hurwitz poly- 
nomial, all the «’s and f’s will be real and positive and will 
separate each other. If we are given the a’s and have to deter- 
mine the «’s and f’s and test for stability, we have only to 
determine whether two cubics in p? have all their roots real. 
Only if the two cubics both have all their roots real do we have 
to consider the question whether they separate each other. 

It is relatively easy to find the real roots of an algebraic / 
equation (especially when they are not close together) either 
graphically or by some successive approximation process such 
as that due to Lin. If the equation under consideration is a 
cubic it must have at least one real root, and when this is found 
and the corresponding factor is divided out, only a quadratic 
remains, so that it becomes immediately obvious whether the 
remaining pair of roots is real or complex. 

If the coefficients of the characteristic polynomial [such as 
f;(p) above] contain a variable parameter, the zeros of the even 
and odd parts can be obtained for a sufficient number of values 
of that parameter, and if these zeros are plotted against the 
variable parameter, the way in which stability breaks down at 
critical values of the parameter becomes clear. 

Dr. O. P. D. Cutteridge (in reply): I believe I was led to the 
condensed determinants after studying various condensation 
techniques given in a number of textbooks. The relationship of 
the condensed determinants to the Bezoutians is the same as that 
of the Hurwitz determinants to the bigradients! of Sylvester’s 
dialytic method of elimination. 


| I would agree with Mr. Fuller’s comments on the German 
work that he quotes. Incidentally, an account in English of 
jome of this and related material is to be found in a section con- 
\ributed by Effertz to the recent translation of Cauer’s treatise.* 
‘The paper by Cremer and Effertz’ was published in March, 
||959, and contains a total of 38 references; the method of treat- 
ment differs completely from that adopted in my paper. 

It is interesting to note that a canonical form for a Hurwitz 


degree case, the expression 


b 
1+ap+— 
Pp 


will be a Hurwitz polynomial for all positive values of the 
coefficients a, a>, a3, b;, b>, b3; and conversely any sixth-degree 
Hurwitz polynomial can be expressed in this form. The third- 
order determinant above compares with the sixth-order deter- 
minant for the corresponding case in Biickner’s form. 

The question of alternative formulations of the sets of stability 
criteria, without the introduction of redundant conditions, is of 
some practical importance and was briefly considered at the end 

of Appendix 7.3 of the paper. Some further variations will be 
given in a forthcoming paper.M 

Incidentally the basic result that either the Hurwitz deter- 
minants of even order or those of odd order can be replaced by 
either of two non-redundant sets of the coefficients was given in 
a much neglected paper by Liénard and ChipartN in 1914 by 
means of an analysis based on certain quadratic forms associated 
with the polynomial. These authors were apparently unaware 
of Hurwitz’s paper° but compared their results with those of 
Routh,’ and with related results of Hermite. The last chapter 
of a Russian textbook ‘Theory of Matrices’ by Gantmacher® is 
devoted to stability and related problems, and contains a much 
more explicit account’ of the results of Liénard and Chipart 
than was given by the original authors, together with an alterna- 
tive proof and some extensions to the basic result. At the 
moment of writing apparently only the first part of this textbook 
is available in German translation, but there is an English trans- 
lation of the relevant second part.Y The paper by Liénard and 
Chipart, which unfortunately has only just come to my notice, 
also gave an example of a condensed determinant, similar to 
those of my paper, obtained by forming the eliminant of two 
polynomials by Bezout’s method. 

‘I am not sure I agree with Mr. Head in maintaining the pre- 
eminence of any one form of the stability criterion, from the 
standpoint of practical application, in view of the varied nature 
of the problems that can arise from practice. 

From the theoretical point of view, however, as the fact that 
the zeros of the odd and even parts of a Hurwitz polynomial 
must be simple, purely imaginary and must separate one another 
is easily obtained, this makes it a very suitable starting-point 
from which to derive other forms of stability criteria. This was 
the method adopted by the author and also by Routh,Y as 
mentioned in the paper. Routh evidently regarded this form 
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as unsuitable for most applications and proceeded to develop his 
two well-known rules, 

As a general principle it seems preferable to investigate the 
character of the zeros of a polynomial indirectly rather than 
calculate them all approximately and then examine the results 
obtained. Although the polynomials whose zeros have to be 
determined in Mr. Head’s method are only about half the order 
of the original polynomial, I doubt whether, generally speaking, 
his method would result in the expenditure of less labour than 
others. 
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SOME RESULTS ON THE CROSS-CAPACITANCES PER UNIT LENGTH OF | 
CYLINDRICAL THREE-TERMINAL CAPACITORS WITH THIN DIELECTRIC FILMS | 
ON THEIR ELECTRODES | 


By D. G. LAMPARD, M.Sc., Ph.D., and R. D. CUTKOSKY, B.Sc. 


(The paper was first received 1st April, in revised form 17th July, and in final form 18th August, 1959, It was published as an INSTITUTION — | 
MOoNoGRAPH in January, 1960.) 


SUMMARY 


The effect on the cross-capacitances per unit length of cylindrical 
3-terminal capacitors of thin dielectric films on the capacitor electrodes 
is discussed. 

It is assumed that the cross-section of such dielectric films remains 
constant throughout the length of the capacitor. Some conformal 
transformations of basic cylindrical capacitor cross-sections are given. 
When these are applied to a symmetric cylindrical capacitor with a 
thin dielectric film on its electrodes, the results suggest that the mean 
capacitance per unit length may remain constant to the first order 
despite the presence of the dielectric film. The same methods also 
suggest that the individual cross-capacitances per unit length may 
remain constant to the first order provided that the dielectric film is 
disposed symmetrically with respect to the capacitor symmetry plane. 

Further support for these conjectures is given by the results of a 
detailed calculation of the cross-capacitances per unit length of a 
parallel-plate cylindrical capacitor with a thin uniform dielectric film 
on one electrode. In the last Section, a lemma concerning the existence 
of an equivalent dielectricless capacitor is given and this is followed by 
the proof of general results of the type suggested by the previous 
working. 


(1) INTRODUCTION 


A new approach to the design of caiculable 3-terminal capaci- 
tance standards has recently been made possible as a consequence 
of a general theorem! concerning cross-capacitances of certain 
cylindrical electrode systems. This theorem enables a capacitor 
to be constructed such that only one precision length measure- 
ment is necessary to compute its capacitance to a high order of 
accuracy. 

A proof of this general theorem, together with detailed 
calculations of cross-capacitances of cylindrical systems with a 
variety of cross-sections, has been given in a recent paper.” In 
that paper the effects of various perturbations which could occur 
in practice were considered, but an important practical perturba- 
tion, namely the presence of spurious thin dielectric films on the 
capacitor electrodes, was dealt with only in one special case. 

Such dielectric films can be present, for example, in a precision 
calculable cylindrical capacitor, where it may be difficult to avoid 
surface contamination of the electrodes with oil films, adsorbed 
vapours, etc. They can also arise in connection with the direct 
experimental determination (by capacitance measurement of a 
cylindrical capacitor) of the permittivity of vapours, where a 
thin film of condensed vapour on the capacitor electrodes may 
constitute a serious source of error. 

In the paper we examine further the effects of thin dielectric 
films in cylindrical capacitors, the assumption being made that 
the cross-section of the dielectric film remains constant through- 
out the length of the capacitor. To facilitate the discussion of 
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of arbitrary cross-section_can be mapped into a cylindrical} 
parallel-plate capacitor. This procedure provides a simple} 
alternative proof of the main theorem on cylindrical capacitors} 
in a more generalized form. 

In Section 3 it is shown how some uniform dielectric distribu-} 
tions behave under the transformations of Section 2, while in} 


Section 4 both cross-capacitances per unit length of the cylindrical | 


dielectric distribution on its electrodes, are calculated in detail.} 
The calculations of Section 3 and 4 suggest some general results, | 
which are established in Section 5, concerning the effect of thin} 
dielectric films. 


(2) GENERALIZATION OF THE THEOREM ON 
CYLINDRICAL CAPACITORS 

Consider a conducting cylindrical shell whose right cross-_ 
section, Fig. l(a), is the arbitrary closed curve S. Let this shell. 
be divided into four parts by infinitesimal gaps at «, B, y and 6. 

By means of an existence theorem due to Riemann, in the 
theory of conformal transformations, the interior of the cross- 
section S can be mapped into the interior of a circle, Fig. 1(d), 
and with a suitable choice of a co-ordinate system the point « 
can be mapped on to the origin and the centre of the circle taken 
on the real axis. 

The mapping w = 1/z transforms the interior of the circle of 
Fig. 1(6) onto the right-hand half-plane of Fig. 1(c), where the 
distances By and yd have been arbitrarily called b and a, 
respectively. 

By means of simple rotations and translations, the right half- 
plane of Fig. 1(c) can be mapped into the upper half-plane of 
either Fig. 1(d)(i) or (ii). 

Finally, the mapping v = log: uw transforms the upper half, 
planes of Fig. 1(d) into the strip spaces of Fig. 1(e). 

The configuration of Fig. 1(e)(i) is that of the usual guarded 
parallel-plate capacitor. The capacitance per unit length can be 
computed by assuming electrode «f to be at unit potential and 
all other electrodes at zero potential. In this case, all lines of 
force are parallel to the imaginary axis and the capacitance per 
unit length, C,, between electrodes «6 and 6 is simply 


ie log? (G6) — lorem 
Ci es = o |. Ph 
1 a 
aw log. (1 a 9) POT cee 


Similarly, the capacitance per unit length, C,, between elec- 
trodes «6 and fy is,* from Fig. 1(e)(ii), 


C2 = ga los, (1 +2) . = ae 


* Note that eqns. (2) and (3) agree with eqn. (75) of Reference 2. 
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f Fig. 1.—Transformed cross-sections of general cylindrical capacitor. 


(a) Cross-section. 


(6) Transformation into z-plane. 
(c) Transformation into w-plane. 
(d) Transformation into u-plane. 


(i) Orientation 1. 


(e) Transformation into y-plane. 


(i) Orientation 1. 


If we introduce a parameter tT = a/b we see that the two 
ross-capacitances per unit length for any cylindrical capacitor 
an always be written in the form 


1 
Ci(r) = gy log. (1 +7) 4) 
1 oa 
C7) = Aq2 1o8e @ te -) Sut See (5) 
Eliminating 7 between these two equations, 
e74wecy ab e—4C2 = 1 (6) 


This identity was given as eqns. (32) and (71) of Reference 2 
n connection with cylindrical capacitors of square and circular 
ross-section, respectively, but its complete generality was not 
1oticed until some time later (Lampard*+ and more recently 
an der Pauw?). 

It follows immediately from eqn. (6) that, if C; = C), 

1 
Cy = C, = 75 log, 2 @) 

It is clear that a sufficient, but not necessary, condition for 
he equality of C, and C, is the existence of an axis of symmetry 
of the capacitor cross-section, which passes through a pair of 
1on-adjacent electrode gaps. This was the form in which the 
heorem was originally stated. It will be clear that the cross- 
ection of any cylindrical capacitor whose cross-capacitances per 
mit length are equal can always be mapped into a cross-section 
1aving this symmetry property, and hence, in the paper, any such 
ylindrical capacitor will be referred to as electrically symmetric. 
[he non-necessity of the physical symmetry condition has been 
0inted out by Thompson.°® 

On differentiating eqn. (6), 


ent Cid, — é> C2dC) 


(8) 
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(6) 
us) 
LOG, b-+| 
BN }x—LOG, (a+ b) 
(ii) Orientation 2. 
Gi) Orientation 2. 
and, in particular, when C, = C), 
dC, eS dC, (9) 


We shall make use of this result in a later Section to prove 
some general results concerning the effects of thin dielectric 
films. 


(3) CONFORMAL MAPPING OF DIELECTRIC 
DISTRIBUTIONS 
The way in which two simple uniform dielectric distributions 
behave under the conformal transformations of Section 2 will 
be examined. Thus consider the physically symmetrical system 


of Fig. 2(a). This system was analysed in Reference 2, where 
PERMITTIVITY € 
{ 
ic 
} 
ee 


(6) 


Fig. 2.—Infinite-plane cylindrical capacitor with dielectric film. 


(a) Cross-section, 
(6) Transformation into z-plane. 
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t= 


PERMITTIVITY € 


(Cc) 


Fig. 3.—Parallel-plate cylindrical capacitor with dielectric film. 


(a) Cross-section. _ 
(6) Transformation into w-plane. 
(c) Transformation into z-plane. 


it was shown that the cross-capacitance per unit length between 
electrodes «8 and y6 is given, when the dielectric thickness is 
small, by 


gs) ions Ine 
Cries 108. 273 ele (1 2) 


2 
+ terms in () and higher (10) 


If we transform this system [which is to be compared with 
that of Fig. 1(d)(ii)] back into a circular cross-section by the 
methods of the previous Section, we obtain the system of 
Fig. 2(b), where the dielectric is still symmetrically distributed 
with respect to the capacitor symmetry plane but is no longer 
of constant thickness. 

From eqn. (10) it can be seen that for this system each cross- 
capacitance per unit length is increased by an amount pro- 
portional to the square of the dielectric thickness when this 
thickness is small. : 

Now consider the electrically symmetrical system of Fig. 3(a), 
which may be compared with Fig. 1(e)(i) (with a = 6). Trans- 
forming the system back by the methods of Section 2, we obtain 
the physically symmetrical systems of Fig. 3(6) and (c), respec- 
tively. It will be seen that the dielectric in the transformed 
systems is neither symmetrically disposed with respect to the 
capacitor symmetry plane nor of constant thickness. 

A simple calculation on the original system of Fig. 3(a) shows 
that, in this case, the cross-capacitance per unit length between 


a and v6 is ' 
c= (daton.2)[1 +241 442) (+... ] 
‘ (11) 


ie. the cross-capacitance per unit length between «8 and 6 i 
increased by an amount directly proportional to the dielectri 
thickness when the thickness is small. 

These results suggest that, for small dielectric thickness, thi 
cross-capacitance per unit length of a symmetric cylindric 
capacitor may be written as a Taylor series in terms of th 
dielectric thickness, in which the coefficient of the linear term i 
zero, provided that the dielectric is disposed symmetrically wit 
respect to the capacitor symmetry plane. That this is so wil 
be demonstrated in Section 5. 

Up to this point we have been concerned only with th 
individual cross-capacitances per unit length. It would be o 
considerable interest to know what happens to the mean capaci 
tance per unit length of a symmetrical cylindrical capacitor whe 
its electrodes have a thin dielectric film distributed on then 
asymmetrically. Thus in the system of Fig. 3 it may be con 
jectured that the cross-capacitance per unit length between ox 
and fy will be reduced by the same amount, to the first order 
as the increase in the cross-capacitance per unit length betwee 
«8 and yé. As will be seen in the next Section, such a result i 
certainly true for the system discussed there (a slightly simplifies 
version of the system of Fig. 3), and in Section 5 it is shown tha 
this result is true in general. 


(4) THE PARALLEL-PLATE CYLINDRICAL CAPACITOR WITE 
A UNIFORM DIELECTRIC FILM ON ONE ELECTRODE 
By the methods of Section 2 any cylindrical capacitor can bi 

transformed into an equivalent parallel-plate cylindrical capaci 

tor. When the original capacitor cross-section may be trans 
formed into one having symmetry about a line passing throug! 

a pair of non-adjacent electrode gaps, the dimensions of thi 

equivalent parallel-plate capacitor have been shown to be suc! 


CYLINDRICAL THREE-TERMINAL CAPACITORS WITH THIN DIELECTRIC FILMS ON THEIR ELECTRODES 


fat each cross-capacitance per unit length is given by C = 
471?) log. 2. 

In this Section we analyse the general parallel-plate cylindrical 
pacitor with the additional complication of a dielectric film 
constant thickness on one electrode. The capacitor cross- 

ction, together with the co-ordinate system that will be used in 

e analysis, is shown in Fig. 4. 
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Fig. 4.—Cross-section of parallel-plate cylindrical capacitor with 
dielectric film on one electrode only. 


The working of this Section will be given in some detail, 
nce the system of Fig. 4 appears to be one of the few general 
ymmetric cases in which both cross-capacitances per unit 
agth can be fairly readily evaluated analytically. 

1.1) Cross-Capacitance per Unit Length from a8 to y5 = C,(€) 


Electrode «8 may be placed at unit potential, all other elec- 
odes at zero. The lines of force are parallel to the y-axis 
id the dielectric boundary is an equipotential. Thus the 
quired capacitance per unit length may be regarded as arising 
om two capacitors in series: 
=1 =i 
ye) 
t) Aart 


he a 
4r(b — 
id hence 
t (e — 1) |-! 
Ci(e) = Z alk ab : : ? (13) 


= [17S +] oo 


In particular, for the symmetrical cylindrical capacitor, 


a@ log: 2 
ba : 


[Cy] (12) 


aan 


(15) 


id substituting eqn. (15) in eqn. (14) and denoting the particular 


ue of C, 4 cr, 
Oe 72 low. 2 2)/ 1 an eee ey by om | 
(16) 


.2) Cross-Capacitance per Unit Length from «6 to By = C,(€) 


The calculations here are considerably more difficult. It is 
mvenient to take electrode «6 at unit potential and all other 
ectrodes at zero. In region B, with the co-ordinate system 
own, the particular solution of Laplace’s equation which 
inishes when y = 0 is 


+0 
V(x, y) = J _B(Ke!* sinh kydk (17) 


id, in region A, is 


+00 
V(x, 9) = | [A\(A) cosh ky + A,(k) sinh ky]e/dk (18) 
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with the boundary conditions 
V(x, t) = Va, 1) (19) 
Wylx, y) E Vax, | 
= A 20 
bg eh ty Ae, a 


These two boundary conditions, together with Lerch’s 
theorem on the uniqueness of Fourier transforms, serve to 
determine the two functions A,(k), A,(k) in terms of B(x): 


A,(k) = 4B(A)(1 — ©) sinh 2kt 
Ak) = 4B(A)[. + ©) — (1 — €) cosh 2k¢] 
On substituting eqns. (21) and (22) in eqn. (18), 


(21) 
(22) 


+0 
Ve = i £1 — 6) sinh 2kt cosh ky 
+ [dd +6 —( —«)cosh 2kt* sinh kyB(k)e/**dk (23) 


Setting y = 5 and using the boundary condition 
Vyas x 0 
x<0 


af (24) 
we obtain, on inverting the Fourier transform of eqn. (18), 


4B(k) {(1 — ©) sinh 2kt cosh kb + [1 +6) 
— (1 —€) cosh2kt] sinh kb} 


1 


eg (25) 
~ 2rd 

= pe ne + d(k) (26) 

Dar | jk i. 

where 6(k) is the unit impulse or Dirac delta function. 
Using eqn. (26) in eqn. (23) and writing 

eee 1l+e Qn 

l-—e 


we find the complete solution for the potential function in region 
A, namely 


+o 
1 1 
Via, y) — i| E Ap 


sinh 2kt cosh ky + (m — cosh 2k?) sinh ky 
sinh 2kt cosh kb + (m — cosh 2kt) sinh kb 


nth | elkx x 


dk (28) 


Now, the charge density on the upper electrode is given by 


1 3 Vig, y) 
we 9 
pit) a) doy ie (29) 
+ 0 
1 1 
Speed elie ae 30 
af Ie Ar 70h | keJkxF Ik, b, t]dk (30) 
where 
F[k,b, ‘| = sinh 2kt sinh kb + (m — cosh 2kt) cosh kb (31) 


sinh 2kt cosh kb + (m— cosh 2kt) sinh kb 


It seems impossible to evaluate eqn. (30) in closed form, so we 
content ourselves with an expansion in powers of f, the dielectric 
thickness. This procedure is reasonable as we are primarily 
concerned with the effects of thin dielectric films. Thus, 
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expanding eqn. (31) in powers of t, and substituting the resulting 
expansion in eqn, (30), 


+ 00 
1 ue es cosh kb 
a oad ; li nth) |e! sinh Kb 
2 ie 
= Las Vie! oe a): 
— m/ 8x _Lik inh? kb 
+ 00 
yy 1 cosh kb 
2 \ 3 ojkx 
ty Gaal Ya [i nth) |e a 
+ terms in f? and higher . (32) 


The three integrals occurring here are readily evaluated, for 
negative x, by contour integration. Results are given in 
Appendix 8.1. On using these results in eqn. (32) we have, 


finally, 
TX xxib 
emx/b t(e—1)} er? b 
a ch d—e/) "be 1 — emlb * (1 — erx/by2 
al 7X 
_ otx/b 
reno ia 
a) ( € ) (eee (ee emx]b)2 


er 4 erxlbygrx/b 


a ad <> e™x/b)3 
+ terms in (¢/b)? and higher (33) 
Here we have reintroduced e by the relation 
aisha heme ESI (34) 
1—m € 


The cross-capacitance per unit length, C,(e), is just the total 
charge on electrode By, namely 


C6 = J ocpax . 35) 


We now substitute eqn. (33) in eqn. (35), and carry out the 
integration, making use of the results given in Appendix 8.2. 


1 
C(€) ——Hieradep) log. ad — E—Ta/b) 


4r 
e7nalb 


e—|1 a 
Aq? ala) € ba b 1 — e-talb 
2 e7malb ma Ee 77alb 
ad er ga nalby2 2 en — 


(36) 


4 1 - e— ie (& 

477? G ( € 21 \ 5b 
t\3 ‘ 

-+ terms in (;) and higher 

Of particular interest is the case of the symmetrical cylindrical 


capacitor for which 


a=log, 2° b= a (37) 


For this case, denoting the ate value of C,(e) by C*, 


(ae 


+ terms in ei and higher (38) 


CHO) = Fae 18 2 jit _ 
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3 : 

We note that the linear terms in eqns. (16) and (38) are equ 

in magnitude and opposite in sign, so that the change in fl 

mean capacitance per unit length is only of the second order 
the dielectric thickness. Specifically 


Ci© + CFO 1 


0 ft\* 72 = TF eee 
2 pea lowe 21 (=) ( € ) 2 


3 
+ terms in (=) and higher G 


This example thus supports the conjecture that, in the ca 
of a thin dielectric film asymmetrically distributed on the ele 
trodes of a cylindrical capacitor, the mean capacitance per ur 
length remains constant to the first order. 

Finally let us briefly consider the results for the gener 
parallel-plate cylindrical capacitor with no dielectric fil 
Setting t = 0 in eqns. (14) and (36) we have (on dropping tl 
arguments of C, and C) 


C= . . . . . . . . . (4 


1 
Cy == an log: d = e—Talb) . ed (4 


As a partial check on the fairly complicated calculations 
this Section we note that the introduction of a parameter 


¢ aero i). ree 


enables eqns. (40) and (41) to be written in the standard par 
metric form of eqns. (4) and (5). 


(5) GENERAL RESULTS CONCERNING DIELECTRIC 
FILMS IN CYLINDRICAL CAPACITORS 

In this Section we give proofs of the following general resul 
which have been suggested by the working of Sections 3 and 

(a) In an electrically symmetric cylindrical capacitor, | 
arbitrary thin dielectric film on the capacitor electrodes produc 
only a second-order change in the mean capacitance per wu 
length. 

(6) In a physically symmetric cylindrical capacitor, a th 
dielectric film on the capacitor electrodes, which is dispos 
symmetrically with respect to the capacitor symmetry pla 
produces only a second-order change in each cross-capacitan 
per unit length. 

Before proceeding to the proof of these results we establi 
a preliminary lemma. This lemma states that, given a 
cylindrical capacitor with a thin dielectric film on its electrod 
there exists an equivalent cylindrical capacitor containing — 
dielectric, } whose electrodes lie wholly within the space oceupi 
by the dielectric film, such that the electric fields, at all poi 
in the dielectricless region of the two capacitors, are equal 
the first order. 

The significance of this lemma is that it shows, for dielect 
films whose thickness is small, that the perturbation required 
obtain the equivalent electrode system is both small, of t 
same order as the dielectric thickness, being in fact bounded 
the dielectric itself, and independent of the field configuratic 
i.e. independent of the individual electrode potentials. 

Thus consider the cross-section of an arbitrary cylindri 
capacitor and, in particular, a small region near electrode o 
say (assumed at zero potential without loss of generality), 
shown in Fig. 5. 


+ Throughout the paper the term dielectric is used in the sense of a mate 
substance. 
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Ep(s) 
ELECTRODE 


DIELECTRIC 
BOUNDARY 


E ,(s) 


Fig. 5.—Cross-section of general cylindrical capacitor with arbitrary thin dielectric film. 


In the following discussion we assume that: 


(i) A(s) is small compared with the length of the line of force 
rminating at s. 

(ii) A’(s) is such that the field is almost normal to the dielectric 
yundary. 

With these assumptions, the field F,(s) in the dielectric can 
regarded as essentially constant for a given s and, on equating 
mal components of electric displacement, 


E,(s) cos 0, = €E,(s) cos 63 (43) 
1 1 + tan? 6 
Eg(s) = ZEW 7earge (44) 
sing the identity 
« tan 0, = tan 0, (45) 
\d noting that for small A“(s) 
tan 6, = As) + o[A(s)] . (46) 
> have 
| 4 2 
Eg(s) = +E 4s) in [2or + ofA%]| 47) 
€ ea 
ah 
€ 
= FEA) + ofA%s)]} (48) 


hus for small A(s) the potential function at the dielectric 
yundary is 


1 
VG) = AWE ‘4(s) + o[A*(s)] (49) 
Now consider the same region of the capacitor but with the 
electric removed and the electrode surface at s advanced a 
stance xA(s) towards the former position of the dielectric 
yundary. The potential at the dielectric boundary is now 
mply 


V*(s) = (1 — DAWES) + of A%G)] (50) 


here asterisks have been used to denote the values of V(s) and 
A(s) in this situation. 
Clearly, if V(s) = V*(s) for all s, then so must the electric 
aids in region A also be equal; in particular we must have 
4(8) = E%(s) for all s. Equating the two expressions for V(s) 
id solving for x, 

e—1 


oe Ee + of[A(s)] (51) 
hich result is independent of s to the first order and also 
\dependent of the way in which the field arises. 

Thus we have shown that the electric field in the region 
noccupied by dielectric remains constant to the first order, if 
1e dielectric and electrode systems are replaced by a perturbed 
ectrode system (without dielectric) such that each new electrode 
ement is a distance A(s)(e — 1)/e trom the corresponding 


original electrode position. Since (« — 1)/e < 1, the perturbed 
electrode system of the equivalent dielectricless capacitor lies 
wholly within the region formerly occupied by the dielectric, 
which proves the lemma. 

Finally, the capacitance per unit length between electrodes «8 
and v6, say, can be computed by assuming that electrode yo is 
at unit potential and all other electrodes are at zero potential. 
Hence 

8 


Cue | nee 


where o(s), the charge density per unit length, may be written 
in terms of the electric displacement at the electrode surface. 
To the first order, this electric displacement is identical with the 
normal component of the electric field at the dielectric, and 
hence 


(52) 


As) 


€ 


1 2) -12 
a(s) = Led! ae | | } + o[A(s)] (53) 


1 
= gE) + ofA] (54) 

We have already seen that the electric field Ea(s) remains 
constant to the first order in the equivalent dielectricless 
capacitor, and so, on using eqn. (54) in eqn. (52), it is clear 
that the same conclusion applies to the capacitance per unit 
length, C,, and by similar working to the other cross-capaci- 
tance per unit length, C). 

As an example of the use of this lemma we may compute the 
first-order changes in the cross-capacitances per unit length for 
the system of Fig. 4 due to the presence of the dielectric film. 
Then, according to the lemma, the, equivalent dielectricless 
capacitor would have the same disposition, but the distance b 
between the electrodes must now be reduced by an amount 


t(e — 1)/e. Thus, using eqns. (40) and (41), 
C,(e) = “ i to the first order (55) 
aye 
4a ¢ —T 
€ 
1 7a 
Cale) = — Fy loge 1— exp / — 2%) 
b—t 
€ 
to the first order (56) 
On expanding eqns. (55) and (56) in powers of #(e — 1)/e, 
a te—1 
= i 57 
Ce) real a Rare + higher terms ) (57) 


whe. log, (1 — e~74/4) 


CX)" 72 
— —tal/b 
— 7 ; : . > i a + higher terms (58) 


which of course agree with eqns. (14) and (36), to the first order. 
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To prove the first general result, mentioned at the beginning 
of this Section, consider an electrically symmetric cylindrical 
capacitor with an arbitrary thin dielectric film on all its elec- 
trodes. By the lemma and its subsequent discussion it follows 
that there is an equivalent dielectricless capacitor, with the 
same cross-capacitances per unit length to the first order, but 
whose electrodes lie wholly within the region formerly occupied 
by dielectric. The change in the mean capacitance per unit 
length, Cy, is then given by 


SG, 8 (42%) (59) 


= 4(dC, + dC,) + terms of higher degree (60) 
But for the electrically symmetric cylindrical capacitor 

(C; = C,) we have, from eqn. (9), 
dC, hoa aed dC, (if (Os — 


GC) . (61) 


and so (62) 


dCy = 0 (to the first order) 


which proves the first result. 

To prove the second general result, consider a physically 
symmetric cylindrical capacitor C with a thin dielectric film on 
all its electrodes, which is disposed symmetrically with respect 
to the capacitor symmetry plane. The dielectric in such a 


Fig. 6.—Sequential placement of dielectric film in a physically symmetric cylindrical capacitor. 


capacitor may be considered to be introduced in two steps as 
shown in Fig. 6. 
When we consider the equivalent dielectricless capacitors, 


which exist by the lemma above, we have, on making use of 
eqn. (9), 


dC; = — SC; (to first order) . (63) 
dCi = — SCZ (to first order) . (64) 
Also, because of the symmetry of the dielectric film, 
8G] = 0C; (65) 
Bed see (66) 


On superposing the two sets of equations to obtain results for 
the whole capacitance, C, 


dC, = dC; + SCY = 0 (to first order) (67) 
dC, = 8C; + SCY = 0 (to first order) (68) 


which proves the second general result, namely that in this case 
the individual cross-capacitances per unit length remain constant 
to the first order. 

Alternatively we may note that, by the lemma, the equivalent 
dielectricless capacitor is also physically symmetrical, and hence 
to the first order the individual cross-capacitances per unit length 
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negative x) by summing residues at the poles k = — jnm/b 
with n = 0,1,2... The results are: 
+o 4 
1 1 cosh kb 1 ae 
ais ee kx oo /b 
| - a 73) kel ah bBo | arb, 
i] eT™/b 


Mf i 


are both (1/4777) log, 2 and thus remain constant to the first ordel 
which proves the second general result. | 


(6) CONCLUSION | 

The application of conformal-transformation methods 1) 
certain types of cylindrical 3-terminal capacitors, whose ele! 
trodes carry thin dielectric films, has suggested some gener} 
results concerning the effects on the cross-capacitances of suc|, 
dielectric films. The detailed computation of the cross-capac 
tances of a parallel-plate cylindrical capacitor (with a thir 
highly asymmetric dielectric film on its electrodes) has furthe 
substantiated these conjectures, and proofs are given in the fing 
Section. } 
It should be realized that the results of the paper apply on| 


throughout the length-of the capacitor. More senctal 
dielectric films will be distributed in a 3-dimensional patter} 
over the electrode surfaces. In this case the potential probler} 
is correspondingly 3-dimensional and hence the powerft 
methods of conformal transformation are no longer availabl) 4 
for its solution. Such problems require further study. 
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(8) APPENDIX 


(8.1) Values of Integrals occurring in Eqn. (32) | 
The integrals occurring in eqn. (32) may be evaluated (for 


7 ib Ta eee 


bag 1 

2ojkx 
: a is a em aa 
| © 


2 md, jee = Tm) err . 7) 
TX 
meee 


(72) 


Amb*| eri? (1 — erlby? 


j +0 
| ; 1 37k COS kb 


a 1 7X 7X ab 
> - sw 2, 31 +4 +4(%) +22 2) 
TX 
1 > erx/b 4 apres 
ee Gere) 1 — ese a — ey 


He | TX 2 
xb) Tx /b 
( 5 (1 + 67>) ¢ 


+ (74) 


ad tes grx/b)3 


CYLINDRICAL THREE-TERMINAL CAPACITORS WITH THIN DIELECTRIC FILMS ON THEIR ELECTRODES 


119 


(8.2) Values of Integrals occurring in Eqn. (35) 


When carrying out the integration of eqn. (35), with o(x) given 
by eqn. (33), the following results are needed: 


gmx/b b if 
ss Tt, 
ip oT hats a 7, l0Be cies rear Ta Eee ECTS) 
—0 
=a 
TX amx|b 74 | ralb 
|e — tog, ee 
ae eee eal ees eee | Te) 
—o 
=o 
eae a8 gTx/b) erex]b 
(d — gm/by3 dx 
—0 
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EXPLICIT FORM OF F.M. DISTORTION PRODUCTS WITH WHITE-NOISE 
MODULATION 


By R. G. MEDHURST, B.Sc. 


(The paper was first received 27th July, and in revised form 10th September, 1959, It was published as an INSTITUTION MONOGRAPH 
in January, 1960.) 


SUMMARY 


When a frequency-modulated wave passes through a network whose 
phase or amplitude characteristics vary non-linearly with frequency, 
distortion terms appear as both frequency and amplitude modulation 
of the output wave. If the characteristics are expressed as power 
series, these distortion terms appear, to first order, as products of 
powers of time derivatives of the unwanted frequency modulation. 
When the frequency modulation may be simulated by a band of 
random noise (as in multiplex telephony carrying large numbers of 
channels), the spectra of the distortion products can, in principle, be 
described by simple algebraic functions of the characteristics (i.e. the 
minimum and maximum frequencies and the r.m.s. frequency devia- 
tion) of the modulating noise band. 

Except in certain special cases, the derivation of these algebraic 
formulae by straightforward analytical methods becomes prohibitively 
tedious for distortions of order much above the second. However, 
once the formulae are found, the insertion of numerical values for 
particular cases is straightforward. In the present paper it is shown 
how the problem can be reduced to the repetition of a number of 
standard operations which can be carried out using a digital computer. 
The technique is illustrated by application to fourth-order distortion 
appearing in the amplitude modulation, generated by terms in the 
amplitude characteristic up to sixth degree. Even in such an 
apparently simple case as this it appears from the literature that 
the closed form of the distortion formula has not hitherto been 
obtained. This example is of direct practical interest since, for 
example, the amplitude characteristic of a maximally-flat-amplitude 
triple-tuned circuit is of sixth degree in the region around the mid- 
band frequency. With a minor modification, the resulting formula 
also applies to fourth-order distortion appearing in the frequency 
modulation, owing to terms in the phase characteristic up to sixth 
degree. 

Use is made of a discontinuous contour integral applied to a 
similar, but somewhat simpler, case by Bennett;! a closely analogous 
course can be followed using the more recently developed theory of 
generalized functions,? but in this particular problem the contour- 
integral method is more economical. 

Formulae for the various orders of distortion in the top channel 
due to amplifier and discriminator characteristics are given in tabular 
form. 


LIST OF SYMBOLS 


@, = Carrier frequency, rad/sec. 
@, = Undistorted frequency modulation, rad/sec. 
wa, = R.MS. frequency deviation, rad/sec. 
4; = Phase modulation, rad. 
Om ®m = Maximum and minimum modulating frequency, 
rad/sec. 
@,, = A representative frequency of the noise-band modula- 
tion, rad/sec. 
dom = A random phase angle associated with the tone of 
frequency w,,, rad. 
w = Difference between a general angular frequency and 
W,, rad/sec. 
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a,, b, = Respective coefficients of nth-order terms in expat) 
sions of amplifier phase and amplitude charad 
teristics, in powers of w/w,. 

a, 6, = Respective coefficients of nth-order terms in expa 
sions of discriminator phase and amplitude chara¢ 
teristics, in powers of w/a,. 1 
Bo, B, = Respective coefficients of zero and first-order term} 
in expansion of discriminator amplitude charac 
teristic, in powers of w/w,. 


, 


Cc, = oe =f Braye 
A= Sut + 1504 
be 
wra/2 
a = ; 
V (Bin QD) 


(1) INTRODUCTION 


A problem which constantly recurs in the design and produc 
tion of f.m. trunk radio systems carrying many f.d.m. speec! 
channels is the minimization of intermodulation distortion 
Distortion of this kind may be generated by echoes in th 
radio transmission path or in the feeders, by interference o 
other carriers, by non-linear modulator characteristics or by 
non-linearities in the amplitude and phase characteristics o 
amplifiers and discriminators. The first two distortion mech 
anisms may now be considered to be reasonably well under 
stood,*-® but the problem of the effect of non-linear transmissior 
characteristics has so far been made amenable to analysis onh 
when the modulation takes the form of a single tone,?-!! or fo: 
special forms of transmission characteristic, !2-!4 

When the amplitude and phase characteristics of amplifier 
or discriminators are expressed as power series in terms of thi 
frequency deviation it can be shown? !> that distortion term: 
appearing both as frequency modulation and amplitude modula 
tion of the output wave appear as products of powers of tim 
derivatives of the frequency modulation; this result holds if thi 
departures from linearity of the characteristics are not to¢ 
great.!3_ Thus one has to deal with expressions of the form 


ema [ew] Eee al 
dt! dt™ dia - a 
When large numbers of speech channels are involved, thi 
f.d.m. signal (in which the speech channels occupy 4kc/s band 
spaced in frequency one above another) can be satisfactoril: 
simulated by a band of white noise.!®17_ Thus, if this signal i 


used directly to frequency-modulate a carrier, the frequenc 
modulation can be written in the form 


A 


Om 
wOy=a Dy COS (Want + Dis) « +2 
O=Om 
where w,, increases in unit steps. 
If the right-hand side of eqn. (2) is inserted into expression (1) 
there can be extracted, when the total order of the term is odd 
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portion coherent with jx, (though possibly not with the same 
ectral distribution). This will not produce distortion in a 
ultiplex-telephony system, although it may distort a television 
mal. The remainder will consist of higher-order portions, 
ther coherent or incoherent with terms of lower order. Both 
pes of term contribute distortion, which appears as unintel- 
sible crosstalk, and they have to be considered separately, since 
ey add in different ways to the appropriate lower-order terms. 
| One is interested in practice in the magnitude of the distortion 
pearing in each channel, relative to the wanted speech power. 
onsidering the channels as occupying bands which are very 
arrow compared with the total base band, it will be sufficient 
| evaluate the spectra of the distortion terms and to compare 
le power at particular frequencies with the wanted signal power 
| the same frequencies. 

‘Evaluation of spectra via the auto-correlation function and 
ie Wiener—Khintchine relation has been carried through with 
urying success in a range of problems concerning modulation 
ith noise and interference by noise (e.g. calculation of echo 
istortion,*® of the r.f. spectrum of waves frequency-modulated 
y noise ®: 18. 19 and of the a.m. and f.m. disturbance of a carrier 
y added noise**). The application of this method to the 
‘(pansion in spectral form of sums of terms such as expres- 
on (1), while straightforward in principle, results in analytical 
ymplexity which becomes very formidable as the order of 
rms increases. 

When wy, has the form of expression (2) the spectrum of 
mctions such as expression (1) can be evaluated, in principle, 
; direct multiplication and summation of all resulting tones of 
© same frequency, with due regard to coherence among 
oups of terms. This was done by Lewin? in a penetrating 
walysis of the spectrum of the mth power of a band of white 
vise. However, the labour of applying similar methods to the 
ore complicated distortion products appearing in practice 
ould be extremely formidable. What is shown in the present 
iper is that after some preliminary rearrangement the problem 
n be reduced to repeated application of two simple rules of 
gebraic manipulation, both of which can be programmed into 
digital computer. 


(2) FORM OF THE DISTORTED OUTPUT FROM AN 
AMPLIFIER-LIMITER-DISCRIMINATOR CHAIN 

The distortion products we wish to evaluate arise from phase 
id amplitude non-linearities in amplifiers and discriminators. 
ie amplifier amplitude characteristic will normally be approxi- 
ately constant over the significant r.f.“band. Constant and 
lear terms in the amplifier phase characteristic will merely 
oduce respectively a phase shift of the carrier and a delay on 
e intelligence; thus the major portions of such terms can be 
btracted off initially. Consequently, the amplifier phase and 
aplitude characteristics can be written respectively as 


dw, + w) = of eas ese dG) 
1+ p(w, + w) =1+ S. bi) Le 


iere the a’s and b’s are of such magnitudes that the output 
stortion is small. The discriminator network will normally 
ve a substantial linear term, its function being to generate 
opy of the frequency modulation in the amplitude modulation 
the outcoming wave. Thus, the discriminator phase and 
plitude characteristics will be written as follows (where 
pital letters are used they are intended as a reminder that the 
ects of these terms are not small): 
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be +a) = E a(S)’ Y bee cin (5) 


Gar 


In practice, amplitude modulation generated by the amplifier 
will be largely suppressed by a subsequent limiter. Under these 
circumstances, it can be shown (Section 6.1) that the output 
of the discriminator takes the approximate form 


p’(we + w) = Bo + > w(2). . 6 
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bi 
YS yt 12 
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b 3 Lf IV 
+ aoe — 450i ,ah2 — 20w}, wh, + 6bwywhy 
c 


+ 15a ,wlt + 10wH?) +... | (7) 

In this expression, only the distortion terms incoherent with 
the wanted signal have been retained. The superscripts attached 
to the wa, terms denote differentiation with respect time. 
Thus wl, signifies (d?/dt?)wyy and w7 signifies (dw,,/dt)°. 


(3) REDUCTION OF EXPLICIT FORM OF FOURTH-ORDER 
DISTORTION DUE TO TERMS OF THE DISCRIMI- 
NATOR AMPLITUDE CHARACTERISTIC UP TO SIXTH 
POWER 

In an earlier paper! the explicit form of the third-order distor- 
tion associated with the discriminator amplitude characteristic, 
taking account of terms in this characteristic up to the fifth 
power, was evaluated by direct integration. Consideration of 
the details of this analysis will quickly make it plain that no more 
complex case is likely to be attempted by the same method. The 
range of the technique advocated in the present paper 1s, however, 
limited only by the availability of sufficient digital-computer 
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running time. The method will best be illustrated by an example, 
which will be taken as the reduction to algebraic form of the 
fourth-order distortion due to terms of the discriminator ampli- 
tude characteristic up to sixth power. With a minor modifica- 
tion, the result can also be made to give fourth-order distortion 
due to corresponding terms of the amplifier phase characteristic, 
as explained in Section 4. 

The terms in expression (7) associated with the discriminator 
amplitude characteristic which contribute to fourth-order dis- 
tortion are 

b; bé 
at + 8 


(8) 


w8, — 45wr wh? — 20} wht) . 
If wy, has the fe given by eqn. (2) and ®, < Oy, wh, is 
contained in w%, with a multiplying factor 150% (Reference 3 
and Reference 15, Section 8.2). Thus, the fourth-order distor- 
tion terms become 

by 7 56 
wi + 15w% wom Sore 45 
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w8 
ze he — 454 wh? — 20w2 wi] . (9) 
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In view of eqn. (2), this is 
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+ 20 ge a COS (wat + Por) 3 acos (w,t + Pa;) 


@q=1 


se aS (w,t + dos) e At cos (w,t + do, 


@g=1 @=1 


(10) 


®,, has been taken here as much less than & @,,, and put equal to 


unity. Expanding expression (10), the fourth-order distortion 
becomes 


at b6 on Om Se {LA 45(+ 4 
8 08 «4 eal aay 3 [ ale Cz w,)(+ @,) 
+ 2002] cos [(w, + w, + w, + w)t + bag + dor + das 
ae doi] 
(11) 


where the signs are taken consistently in all possible arrangements. 

To evaluate the spectral distribution of expression (9) we have 
to put the combination frequency w, + w, + w, + @; = Wm, 
where w,, is a general base-band frequency. Many of the tones 
in expression (11) add in coherent groups. To take account 
of this the summations are rewritten with w, > w, > w, > w, 
to ensure that the tones then appearing are mutually incoherent. 
If in expression (11) the full ranges of the variable frequencies 
were replaced by ranges restricted in this way, with no other 
modification, most of the tones would be excluded. All excluded 
tones, however, would be coherent with one or other of the 
retained tones. Thus, it is possible to modify the expression for 
the tone amplitudes to take account of the coherent groups. The 
procedure is considered in detail in Section 8.2 of Reference 15. 
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It is found that, after grouping coherent terms, expression ( 
brvones 
at a cali 2 2 

= SOS OY (0244 + 1202+02+02+-0?) 


8 @q=4 @,=3 Ws=2 a=1 } 
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where again the signs are taken consistently in all possill 
arrangements. ;| 

Since all the tones in expression (12) are now incoherent, t] 
total contribution at frequency w,, can be found by taking 
root of the sum of the squares of the amplitudes of all ton} 
such that w, + w, +W, hw, = Wy, Of —W,. This conditid 
implies limitations on the upper limits of the summations, whii) 


w,, @, and w, (cf. Reference 15, Section 8.2, where the simp] 
third-order distortion case is examined in detail). To avo} 
such considerations we allow w,, w,, w, and w, to cover 
full range while introducing a discontinuous factor which w) 
count the contribution of a tone if its frequency is w,,, or —@| 
and reject it otherwise. The factor required is evidently tl 
Dirac 5 function, which might, for example, be introduced 
its Fourier integral form,? i.e. 


1 co 
d(x — p) = a J el—P)zdz 


or as an integral of Bessel functions [obtained by differentiati 
the Weber step function (p. 406 of Reference 20)]. For th 
present purpose, the most economical course (see Section 6.! 
appears to be to take the discontinuous factor in the form of 
contour integral previously used by Bennett (p. 609 ¢ 
Reference 1). It can be shown, following the same line ¢ 
reasoning as in Reference 1, that the total contribution < 
frequency w,, which can be extracted from expression (12) is 
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In this expression the summations in expression (12) have 
been changed to integrals, and as before, the signs in the first 
set of multiple integrals must be taken consistently in all possible 
arrangements. The contour integration is taken along the real 
axis from —©O to ©, with a downward indentation at the origin. 
In order to evaluate expression (13), the following reduction 
formulae are required (Reference 23, p. 127, and Reference 1, 
p. 609): 


[ area =e ea me ae: te... + (—1)77! 
! 
OLE, yr | (14) 
a 
eIxZ Qajmxm—1 ] 
— = 0 
and ) zm dz (m = D! for xm (15) 
= Omtornx.<<10 ; 


The expressions in the curly brackets in each of the ten integrals 
of expression (13) must first be squared and the corresponding 
terms collected. In the present example this was done by hand, 
although in a higher-order case it would be worth considering 
the use of a computer, even at this stage. The successive 
integrations, up to the final contour integration, will now generate 
terms of the form 


ajP Awww? exp [j(Aw, + pw, + vw 


3 +'Se,)| ~ (16) 


where the eleven Greek letters are all numbers determined by 
repeated application of eqn. (14). 

It is apparent that the evaluation of eqn. (13) in algebraic 
form has been reduced to the mechanical repetition of a set of 
numerical operations. The number of terms to be processed 
would present a formidable task to a human operator (after 
the initial squaring, we start with 46 terms in each integral, 
increasing to about 130 with the first integration alone), but can 
be handled with no special difficulty with the help of a digital 
computer. Some notes on the programming procedure are given 
in Section 6.3. The final result emerges as 
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(4) COLLECTED EXPRESSIONS FOR DISTORTION IN 
TOP CHANNEL 
In Table 1 are collected the formulae for the various orders 
of distortion in the top channel (where distortion is generally 
worst), so far as they are at present known. Fourth-order 
distortion due to the discriminator amplitude characteristic was 
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evaluated in the previous Section; second- and third-order 
tortion from the same source was evaluated by hand, using t 
method given in Reference 15. The corresponding orders 
distortion due to the amplifier phase characteristic can 
deduced by noticing that the expression in the first sque 
bracket of expression (7) is (apart from nomenclature cha 
and changes of sign) the time derivative of the expression — 
the second square bracket. This Table extends Table 1 
Reference 15—in which there is a misprint in the fourth en 
from the top, where 0-47 should be 0-047. 
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(6) APPENDICES 
(6.1) Derivation of Expression (7) 


An expression from which the term of expression (7) in the first 
of square brackets can be derived is given in Reference 9. It 
s not then proved for a general modulation, but inferred from 
> form of the phase-modulation distortion when the wanted 
\dulation is a single tone. The term of expression (7) in the 
ond set of square brackets was given in Reference 15, the proof 
‘olving the general expression for the spectrum of an f.m. 
ve with multi-tone modulation given by Cherry and Rivlin.”! 
this Section an outline is given of a direct derivation of both 
ms. 

Consider first distortion due to an amplifier characteristic, the 
mittance corresponding to the characteristics shown in eqns. (3) 
d (4) is 
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the modulated wave and a delay of the modulation, respectively. 
en the output of the network can be written2? in the form 
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After some manipulation this reduces to 
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where X = cos 4, and Y = sin p,. 

To a first-order approximation the phase-modulation dis- 
tortion is given by the imaginary term in the curly brackets. 
This is 
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where the superscripts denote differentiation with respect to time. 
Performing the indicated differentiations and differentiating the 
whole expression with respect to time to recover the frequency 
modulation yields the term in the first square bracket of 
expression (7). 

The distortion due to a discriminator characteristic follows 
in a similar way. The admittance is now 


By ScOn fee ) 
| | non SG AG 
(Zo oie > ae ) expi( > ait 
B ier De 
~ By +—w+ Y ar +j(B + Zw) x 
We n=2 wh 
B, co b, c. 
Bo + “oo” a 
ane 2 wy J ae 


Applying the frequency-modulated wave to this admittance, the 
output wave can be written in a form similar to expression (18). 
To a first-order approximation the amplitude modulation will be 
given by the real term of the factor multiplying exp [/(w.t + »)], 
in the expression obtained in this way. This turns out to be 
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from which, after carrying out the differentiations, there emerges 
the term in the second square bracket of expression (7). 


(6.2) Alternative Form of Expression (13) Using Generalized 
Function Theory 


Considering, for example, the tones of expression (12) whose 
frequencies are of the form w, + w, + Ws + W,, We require to 
extract tones of frequency w,, from 
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where f(w,, w,, ws, w,) is defined in expression (12). Since all 
contributions will be incoherent, the total amplitude, F(w,,, ®,,); 
of the tone of frequency w,, is given by 
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d(w, + w, + @, + @; — Wm) do, 


The 6-function has the property that it is unity when Oy + Oy + 
@, +; =,,, and zero otherwise. Taking S(w, +o, + 
w, + w, — w,) in the form? 
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and rearranging the integral, we find that 
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The right-hand side of eqn. (21) has a very similar appearance 
to that of the multiple integrals in eqn. (13), and in carrying out 
the evaluation the four inner integrals are, of course, dealt with 
in the same way. The final integration with respect to z, how- 
ever, follows a different course. We find from Table 1 of 
Reference 2 that 


(22) 


where sgn x = + 1 for x >0 and —1 for x <0. 
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Eqn. (22) here replaces eqn. (15): it has the disadvanta) 
that all terms in the final integration must be retained, where} 
many terms drop out when eqn. (15) is used. This feature | 
eqn. (15) has the important practical consequence (Section 6) 
that families of terms may be dropped in the earlier integratior) 
Of course, the use of eqn. (22) must ultimately lead to the sar) 
result when all terms are collected, but it appears that eqn. ( 1 
presents the more economical course. 


(6.3) Note on the Computational Procedure 


The reduction to algebraic form of the ten integrals in exprel 
sion (13) was carried out on a Hec 2M digital computer, whi 
uses punched-card input and output. Each integral wy) 
evaluated’ separately. The coding arrangements were sud 
that each term of the form (16), specified by eleven sn 


it is necessary to pune up 46 cards. The programme Heri oH 
the operation indicated by expression (14), generating a new s} 
of cards (about 130 after the first integration) of precise} 
similar form to the input cards. Since the programme wil 
designed to carry out each integration using a specified variab) 
of integration, it would now be possible to return these card 
to the machine without further processing for the second rou 
of integration. If this were done, rather more than 7000 carq) 
would emerge from the fourth integration. To avoid this, th 
output cards at each stage were sorted in a high-speed so 
into coherent groups, which were then added using a secon} 
programme. Besides reducing the number of cards to managi 
able proportions, this procedure allows one to throw awal 
groups of terms in each stage, since, in view of the secon} 
alternative of the final contour integration [eqn. (15)], it becom¢ 
possible by inspection of the exponential index to predict th¢ 
these terms will make no final contribution. After the fourt 
integration and a final sorting, a further programme perform 
the operation indicated by eqn. (15). 
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SUMMARY 


The paper derives an analytical expression for the describing function 
nd for the Fourier coefficients corresponding to saturating systems 
yhich are excited by sinusoidal signals. The saturation characteristics 
re assumed to be symmetrical and single valued. The method 
avolves an approximation of the actual system characteristic by means 
if an exponential curve. 

| The results are given in the form of graphs of the fundamental and 
hird-harmonic components of the system output and a graph of the 
ystem describing function. The latter will be of use in the analysis 
)f non-linear feedback systems. A comparison is shown between the 
oresent method and the technique of approximating saturation charac- 
‘eristics by means of straight lines. Formulae are given by means of 
Which the higher-order harmonic components of the system output 
‘may be computed. 

| Secondary results of this paper are Tables of the modified Struve 
functions L;(z) and L2(z) which are not available elsewhere in the 
literature. 


LIST OF SYMBOLS 


A = Output of a system when completely saturated. 
a,(z) = (m + 1)th term of the series for L,(z). 
a, = Fourier coefficient for the nth harmonic of the system 
output. 
B = Amplitude of the system input sine wave. 
c = Constant such that Ac is the small-signal gain of the 
system. 
= Modified Bessel function of the first kind of order v 
for an argument z. 
k, m, n = Summation variables. 
L\(z) = Modified Struve function of order v for an argu- 
ment z. 
= System describing function. 
se Integral involved in the derivation of Fourier 
coefficients. o 
x, y = The system input and output variables. 
T(z) = Gamma function for an argument z. 
' @ = System input angle. 


Iz) 


(1) INTRODUCTION 


A commonly occurring phenomenon in all types of physical 
system is that of saturation. In the field of electrical engineer- 
ng, for example, saturation of electronic amplifiers, magnetic 
saturation and dielectric saturation are commonly encountered. 
The analysis of systems containing such non-linearities is neces- 
sarily difficult since the principle of superposition is not valid. 

In electrical systems, devices having saturation characteristics 
ire often subjected to sinusoidal excitation. As a result, the 
Jutput of the device will contain harmonic components as well 
4S a component having the same frequency as the input excitation. 
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A knowledge of the amplitudes of these various components of 
the output may be of considerable interest as, for example, in 
a.c. machines. In the field of feedback control the analysis 
of systems which contain saturating devices is commonly 
accomplished by the use of a describing function,! which, for a 
saturating device, and also for any other non-linearity, is defined 
as the ratio of the amplitude of the fundamental component 
of the output, expressed as a complex number, to that of the 
input. It is assumed that the input to the non-linearity is purely 
sinusoidal. 

Since practical saturation characteristics are not known in 
exact analytical form, the most accurate method of determining 
the amplitudes of the various components of the output in 
response to a sinusoidal input is to perform a Fourier analysis 
by a numerical method. This method has the disadvantage of 
being quite tedious even if a relatively small number of ordinates 
is used. An alternative method which may be employed is to 
approximate the actual saturation characteristic by a curve which 
may be described analytically. The Fourier analysis of this 
approximate curve may now be performed exactly. Theresultant 
Fourier coefficients are then used as approximations to those 
corresponding to the original saturation characteristic. 

The paper is concerned with an approximation method of this 
type which is able to give a close estimate of the fundamental 
component of the output with a minimum of labour. The 
method is thus well suited to the describing-function analysis 
of feedback control systems containing saturating devices. 
Calculation of the third-harmonic component of the output is 
obtained equally easily by use of a graph given in the paper. 
Higher-order harmonics may be computed using formulae which 
arealso given. Itis to be noted that the saturation characteristics 
discussed here are symmetrical and single valued. 


(2) APPROXIMATIONS TO SATURATION 
CHARACTERISTICS 

There are many analytically describable curves which could 
be used to approximate an actual characteristic. While the 
paper is concerned with one particular curve, it would seem 
worth while to consider several other possible curves with a 
view to comparing their utility. 

Four possible curves will be considered, namely 


(a) 3-segment straight line. 


1 1 
PTAC a IS 
c c 
1 
y=A poe ari SRS ARE ELS (I) 
y=-A ees 
c 
(b) Hyberbolic tangent. 
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(c) Froelich’s equation (hyperbola). 
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a. 
(d) Exponential. 
y= + All — exp(—cjz|)] « 2) oh }|) 
where y>O x>0 
fy <0 aa 


It will be noticed that the four curves considered have been 
chosen such that only two parameters must be specified in order 
to define a particular curve. These two parameters are A, the 
maximum value of the system output y, and c, where Ac is the 
slope of each of these curves at the origin. Graphs of each of 
the four curves are shown in Fig. 1 for positive values of x, the 
independent variable, and for A = 1,c = 5. 


O-4 


(e) tee | 
10) ore 0-4 O-6 o's 1:0 


Fig. 1.—Comparison of approximating curves. 


(a) Straight line. 

(ob) y = tanh 5x. 

(c) y= x/0'24+ x. 

(a) y = 1 — exp (—5x). 


Because only two parameters are needed to define a specific 
curve, it is relatively easy to approximate the given saturation 
curve by one of the curves shown in Fig. 1. Knowledge of the 
maximum value of the system output and of the system gain for 
very small signals are sufficient to determine a specific curve, 
although in some cases, as discussed in Section 5, the value of 
Ac may be chosen not exactly equal to the slope of the charac- 
teristic at the origin. The use of approximating curves which 
require more than two parameters to be determined will enable a 
closer approximation to be made, although, of course, this will 
be at the expense of ease of analysis. 

A comparison of the curves in Fig. 1 shows that the straight- 
line approximation will be most suitable for systems which 
have very sudden limiting of their output. This approxi- 
mation for a saturation characteristic is widely used in the 
describing-function analysis of non-linear feedback control 
systems. The Fourier coefficients for the fundamental and 
third-harmonic components of the output of a system having the 
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straight-line approximation and excited by a sinusoidal input a} 
readily availabie in the literature.” 

The hyperbolic-tangent approximation gives a curve ; 
relatively rapid saturation and hence could be a useful approx) 
mation for many physical systems. To the author’s knowledg 
the appropriate Fourier coefficients for a system having | 


ture. 
such coefficients. 

Froelich’s equation is well known as an approximation fc 
magnetization curves. As may be seen from Fig. 1, the cury) 
approaches saturation very slowly. A method of obtaininj 
the Fourier coefficients corresponding to this curve by a serie 
summation method is given in Reference 3, which also list 
some additional 2-parameter approximating curves for sat Z| 
tion characteristics. é | 

The exponential _ approximation, which is to be dischseaaa i 
the paper, is seen to lie between Froelich’s equation and thj 
hyperbolic tangent curve so far as rapidity of saturation ij 
concerned, This approximation is suggested in Reference 3, bu} 
no Fourier coefficients in closed form are given in the literaturi 
to the knowledge of the author. This approximating curve ij 
of a shape which makes it useful for the approximation of man 
practical saturation characteristics. An example of its use fou 
the approximation of a typical electronic-amplifier characteristiq 
is given in Section 5. 


(3) DERIVATION OF FOURIER COEFFICIENTS 
From eqn. (4) the saturation characteristic is to be described by: 


y= + All — exp (—clx))]' 2 ae i) 
where eSSTUN ee esl) 
Vi Oeeea0 

For a sinusoidal excitation 
x= Becost' >. — 6) 


The Fourier series for y(@) is of the form 
y(6) = a, cos 8 + a;cos 30 +...) 1 5 ae 


The Fourier coefficients may be evaluated by consideration of 
that portion of (6) which lies between 0 = 0 and 6 = x2. 
In this region 


y(@) = A[1 — exp (—cB cos 9)] (8) 
Hence the Fourier coefficients are given by 
4A ra 
a =| exp (— cBcos @)] cosnfd@ . . (9) 


where n is odd. It is shown in Section 8.1 that a, may be 
expressed in terms of the following finite series in which z = cB: 


in = 2A[T (2) — Ly] + 


1 
ze a4 Ss (—1)&+2)/2 
VT k=4,6.. 


(n* — 1*)(n? — 32)... [n? 
(K—2l2 
k — 15 ia 


x [In — Lan] 


k-3yr( ao *) 


x 


(10) 


Expressions for the first- and third-harmonic coefficients are 
a; = 2A(K(CB)—L(cB)] . . . . At) 
8A, 
a 2A [1,(cB) == L,(cB)] a a platcB) ae L,(cB)| 
8A 
aS noe a pllacB) ale L,(cB)] C ci C “ 5 & (12) 


he describing function for the non-linearity is then given by 


2A 
N= = [NcB)—Ly(cBy] . 13) 


(4) COMPUTATION OF FOURIER COEFFICIENTS 


_In order to compute the Fourier coefficients, values for the 
ametions L;/. and I,;. are required for a suitable range of the 
(Tgument z = cB. Since the coefficients a, and a; are of greatest 
aterest, computation of L,(z), L,(z), I,(2) and I,(@) only is 
quired here. As shown below, values of the higher-order 
junctions required for the computation of further Fourier 
joefficients may be readily obtained by means of recursion 
ormulae or by summation of a series: 

| The only modified Struve functions which appear to be 
abulated in the literature are L_ (Zz), L_;(z) and Lo(z), as 
tiven in Reference 4. Now in general it is true that 


4-12 tui = +|(Q)/ro+are] . a4 


if v = O, this reduces to 


2; 
Li =m a + L_,@) . a) pie . (15) 
and if vy =1, 


2) 22 
4 


L,(z) = Lo(z) = 


(16) 
Using eqn. (15), Table 1 of L,(z) was computed for z 
‘anging from 0 to 5 in intervals of 0-1. z= 5 corresponds to 
the output of the non-linearity reaching to within about 0:7°% 
of its saturated value. Because the values of L_,(z) used were 
labulated to 8 decimals, so also were those of L,(z). The 
maximum error is +1 in the last figure. 

Values of L,(z) shown in Table 2 were computed for the same 
range of z using eqn. (16), Table 1 and the Table of Lo(z) in 
Reference 4. Because the operations of division and subtraction 
in eqn. (16) can lead to loss of several significant figures, L,(z) 
was rounded off to four decimal places which should be sufficient 
for most engineering work. 

In order to compute L,(z) for v > 2, the recursion formula of 
2qn. (14) may be used or the following series may be summed: 


ie aaa pee ahd We SACLE) 


m=0 
(4z)2m+1+y 
Ton + 90 + +9 


Values of the function I,(z) are more readily obtainable. A 
useful Table for the purposes of this paper is given by Dwight,° 
Where I,(z) is tabulated to five significant figures for integral 
values of v from 0 to 11 and z from 0 to 6 in intervals of 0-1. 

Calculated values for the Fourier coefficients a, and a3 are 
siven in graphical form in Fig. 2 where a,/A and a;/A are plotted 
gainst cB. These graphs should be sufficiently accurate to be 
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where Anz) = (18) 
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VALUES OF THE MODIFIED STRUVE FUNCTION L,(z) 


Li@) 


Li@) 


epoooo oooodceo 
SoWwoana ABWNH O 


— ek ee 


SOwmID AKRON 


NBWYMNYD NKR ee 


WABWNe 


0-000 000 00 
0-002 123 48 
0-008 510 93 
0-019 213 48 
0-034 31688 
0-053 942 19 


0-078 246 81 
0-107 425 88 
0-141 71392 
0-181 38685 
0-226 764 38 


0-278 21275 
0-336 147 87 
0-401 038 94 
0-473 41249 
0-553 85691 


0-643 027 57 
0-741 65246 
0-850 538 57 
0:970 57879 
1-102 75979 


1-248 17052 
1-408 01174 
1-583 606 50 
1-776 41169 
1-988 03077 


Table 2 


WWWWWw 


ABPHAH AHAADBD Pwwnwnw 


lan ole oe Koy) ABRWNRe 


2220227 88 
-474 943 22 
2-754 31021 
+060 674 15 
*396 61295 


3+ 764.959 82 


+168 828 31 
611 639 84 
097 153 98 
629 501 79 


+213 222 46 
+853 303 67 
*555225 94 
*325011 47 
16927781 


095 29699 
-11106045 
-225 35061 
-447 81951 
~789.075 37 


-260777 81 
-875 742 64 
648 05718 
-593 207 04 
+728 21578 


VALUES OF THE MODIFIED STRUVE FUNCTION L,(z) 


La(z) 


L2(z) 


— eee ee 


©'oo~1 0" URWNR © 


SSMAD ALONE 


NRNNNLY Nee 
ABWN 


0-000 00 
0-000 04 
0-000 34 
0-001 15 
0:0027 
0-005 4 


0-009 3 
0:0149 
0:0224 
0-032 2 
0-044 5 


0-0598 
0-0785 
0:1010 
0-1278 
0-1594 


0-196 3 
0-239 1 
0-288 6 
0:3454 
0-4103 


0:4842 
0:5680 
06629 
0-7699 
0-890 3 
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0255 
1771 
-3467 
5362 
‘7477 


— ee 


9833 
-2456 
+5373 
-8614 
223 


-6205 
063 0 
+5534 
0965 
+6976 


+3626 
098 2 
“9113 
-8101 
+803 0 
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Fig. 3.—Graph of the describing function. 
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useful for engineering purposes. For use in the analysis of 
non-linear feedback control systems, a plot of the variation of 
N/Ac with cB is given in Fig. 3. This graph should have ade- 
quate accuracy for such a purpose. 


(5) CURVE FITTING 


The utility of the method described in the paper is dependent 
on a procedure for approximating the actual characteristic by 
the exponential curve. A description of the curve-fitting pro- 
cedure in an actual application of the method will reveal many 
of the points to be considered. 

The problem to be considered is that of obtaining a plot of 
the variation of the describing function with the input amplitude 
for an electronic amplifier whose characteristic curve is shown 
in Fig. 4. Only the curve for positive x is shown here. The 
corresponding curve for negative x was symmetrical with dif- 
ferences of such small magnitude as to be negligible. By actual 
measurement from the characteristic it is found that A = 47 
and c= 3°7. 
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Fig. 4.—Various approximations to the amplifier characteristic. ~ 
(a) Amplifier characteristic. 
(6) Straight line, c = 3-7. : 
(c) Exponential, c = 5. 
(d) Exponential, c = 3-7. 


The selection of a suitable exponential curve must now be 
considered. The logical choice would appear to be an exponen: 
tial curve for which A = 47 and c = 3:7. This is a charac 
teristic such that its small-signal gain, namely 174, is the same 
as that of the actual amplifier. However, by reference t 
Fig. 4, it is seen that this exponential is not a very good fit 
in the sense of maximum vertical distance between it and the 
amplifier characteristic. Whether the exponential having th« 
same values of A and c as the actual curve is a good fit depend: 
entirely on the shape of the actual curve. If the curve had, b; 
chance, been of exponential shape a perfect fit could, of course, bt 
obtained. It will thus be seen that those characteristics which are 
best approximated by the methods of the paper will be thos 
having approximately exponential shape. 

Consideration will now be given to the question of whethe. 
a better fit, in the sense mentioned above, could be obtained by 
an exponential curve with A = 47 but with a value of c othe! 
than 3-7. Since this question is likely to arise in most applica 
tions of the method of the paper, curves of 1 — exp (cx) versus 3 
have been given in Fig. 5 for various values of c. To use thes 
curves it is suggested that the normalized system output (i.e 
actual output divided by the saturation-level output) be plottec 
against the normalized system input (i.e. actual input divided bi 
the maximum input) on tracing paper to the same scale as tha 
of Fig. 5. By superimposing this trace on Fig. 5, it is a fairl 
simple matter to pick a suitable value for c. 

In Fig. 4 are plotted exponentials with A = 47 and c =: 
and c = 3-7, and the original characteristic curve. The fi 
between the exponential with c = 5 and the characteristic curv 
is judged to be about the best. The question of how logically t 
choose the best exponential is a difficult one to answer. Th 
criterion by which the fit is judged must surely be some functiot 
of the difference between the Fourier coefficients correspondin; 
to the approximating curve and those corresponding to the actua 
curve. How this function (if a suitable one was chosen) couk 
be related to the fit between the two curves is a problem whicl 


Or 
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Fig. 5.—Exponential curves for various values of c. 
y = 1 — exp (—ex). 


1e paper does not attempt to solve. In choosing the value of 
= 5, however, the following intuitive criteria were used: 


(a) The vertical distance between the two curves must be kept 
) a minimum at all points. 

(b) The net area between the two curves should be as small as 
ossible. 

(c) The slopes of the curves near the origin should be as close 
s possible. 


The results obtained using the exponential with c = 5 seem to 
istify this approach and it is suggested that, in lieu of any better 
riteria, the above method will give satisfactory results in most 
ASES. , gayest 

The variation of the describing function N with B is given in 
ig. 6 for the actual curve and for various approximations to the 
urve. The describing function for the true curve was computed 
y a numerical Fourier analysis, and those for the two exponen- 
al curves with c = 3:7 and c = 5 were computed using Fig. 3. 
‘he difference in effort required to obtain N by these two methods 
; an indication of the usefulness of the approximation technique. 
Iso shown in Fig. 6 is the describing function for the straight- 
ne approximation shown in Fig. 4, with c = 3-7, which is the 
pproximation commonly used in non-linear control-system 
nalysis. 

By comparison of the various curves in Fig. 6 it may be seen 
nat the describing function for the exponential with c = 5 is 
efinitely much more accurate than that with c = 3-7 except for 
»w B. A comparison of the former with the straight-line 
escribing function again shows that the exponential approxima- 
ion gives a considerable improvement in accuracy except for B 
»ss than about 0:15. The reason for this discrepancy at low B 
; the fact that the slope of the exponential for small B is greater 
han that of the actual curve in this region, 
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Fig. 6.—Comparison of describing functions for amplifier. 
(a) True curve. 
) Straight line, c = 3-7. 
c) Exponential, c = 5. 
(d) Exponential, c = 3-7. 


A great improvement in the accuracy of the describing function 
of the exponential for low values of B can now be obtained by 
recognizing that the describing function for any smoothly 
saturating curve cannot exceed Ac. Hence, in this case, an 
excellent approximation to the actual describing function is 
obtained by using the exponential describing function for c = 5 
for B greater than or equal to about 0:14 and the straight-line 
describing function for B less than this value. The resulting 
2-section describing function has a maximum deviation from the 
true describing function of only about 7% as compared with a 
deviation of about 21% for the straight-line approximation, 
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This procedure of constructing a 2-section describing function 
is always possible where the value of c chosen for the exponential 
using Fig. 5 is greater than that corresponding to the actual 
characteristic. 


(6) CONCLUSIONS 


The mathematical derivation of the paper results in a relatively 
simple method for the determination of the describing function 
of a physical device which is subject to saturation. The method 
is also useful for the computation of the Fourier components of 
the output of such a device when driven by a sinusoidal input. 

The particular approximating curve is only one of several 
possibilities. The derivation of the Fourier components corre- 
sponding to alternative approximating curves would be a field 
for further study. In addition, an investigation of possible 
criteria for the curve-fitting process could yield useful results. 
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(8) APPENDIX 
Derivation of Fourier Coefficients 
From eqn. (9) 


7/2 
i [1 — exp (—cB cos 6)] cos n6d0 
0 


4A 
a, = — (19) 


where 7 is odd. Separating the two terms of the integral gives 
44 pul? 44,7? 
oo | cos nOd@ — — [ exp (—cBcos 6) cosnfd0 . (20) 
Giesg Tso 


The first term of this expression when evaluated is 


es poppet 
nT 


Consider next the integral 


m/2 
P = | exp (—cB cos 8) cos n6d0 (21) 
0 
Tt follows that 
7/2 i 1/2 
P= [ cosh (cB cos @) cos n6dé — [ inh (cB cos 6) cos nOd6 
0 0 
ae: as. (22) 
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Integrating by parts 


n 


: T/2 
P= cosh (cB cos @) ae | 
0 


7/2 


sin nO 
+ cB i 


0 


sinh (cB cos 6) sin 6d0 


n 


| ; sin oh f 
— | sinh (cB cos 8) 
0 


me /2 


x co ea cosh (cBcos@)sinéd6 . . @ \ 


n 


0 Ni 
In this expression the third. term vanishes and the first tern) 
becomes ‘| 
(—1)e-DP 

n 


for the case where n is odd: 


2. 42 
n(n we 


a n? 6 


sinn? = nsin 0 — 


Dye og Phe) a 3b} 
if n(n ae 3 ) ns r 


oe) a 
It follows that 

: : De 
sinnO sin? 29 _@ 17) nt 6 
bi es ON SOME 

(P= PEP peg 


(ne? — 12\(n? — 32)... [n?@-&—3)?] | 
(—1)&+22 iD! sin’ 0. .). (2am 


Hence P may now be written as 


a 


—)@-D/2 wiz 
P= Gen ae cB | sinh (cB cos 0) sin? @d0 
0 

n+1 n/2 , 
+ cB [ (—1)&+22 A 
k=4,6...°0 a 

(n2 — 1*)(n? — 3?) .. . [n? — (k — 3)?] sinh (cBcos 8) sin* 6 
x k— DI dé 


7/2 
= cB | cosh (cB cos @) sin? 0d0 
0 


an fC 
k=4,6...~0 
e (n2 — 17)? — 32)... [n* — (k — 3)?] cosh (cBcos 8) sink 0 a 
: (k—1)! 
(26 


Now the modified Bessel function of the first kind of order v is 
given by® 


2(z)/2)v, 77 


Iz) = Semen cosh (z cos 9) sin?” 640 (27 
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|1nce, by rearrangement and comparison with the above expres- 
ns, P may be written as 
i (—1)@-ve 


(28) 


7 Ti 
+ 5hi(cB) —3 Li(cB) 


St DE MPG? — 10? — 34)... [@?—&— 3) Je 


k=4,6... (eat)! 


| re/2 r/2 
: I Bah (eRicos 6 \sin® 0d0 — [ cosh (cB cos 6) sin* a0 | (29) 
0 0 


Looe 
n 


+ F[Li(cB) — 1(cB)] 


— 1?)(n? — 3?) .. . [n* — (k — 3)*]cB 
(k — 1)! 


n+1 (n2 
Aust bY (—1)6+/2. 
k=4,6... 
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Substituting this expression for P in eqn. (20), and noting that 
TQ = \/7, a, may be written as 


a, = 2A[I,(cB) — L(cB)] 


+1 
Sper 
k=4,6... 


(n? — 12)(n? — 32)... [r? — (k — ayn(=e") 


(pe Dieales 


[Lij2(cB) — Lyjo(cB)] (31) 


DISCUSSION ON 


‘EFFECTS OF ARGON CONTENT ON THE CHARACTERISTICS OF NEON-ARGON 
GLOW-DISCHARGE REFERENCE TUBES’* 


BEFORE THE ELECTRONICS AND COMMUNICATIONS SECTION, 23RD MARCH, 1959 


Mr. A. J. Young: The authors point out that the influence of 
rgon on glow-discharge tubes has been studied for a long time. 
hese effects were, of course, known in Germany in the 1930’s. 
o far as I know, however, there has been very little published 
formation before this series of papers. I think now we have 
omething we can use on a subject which has been studied in 
1ore detail. 

I am not surprised that the authors found great variations 
ith the film-coated tubes. After all, the film is very thin and 
fhen one realizes the difference in surface conditions that one 
ets, even with a thermionic cathode, even greater variations 
ould be expected with these extremely thin, highly reactive films. 

The authors conclude that the most desirable characteristics of 
eon tubes with molybdenum cathodes are obtained when the 
rgon content is about 1:0%. I do not think that is right. For 
xample, the plateau of the curve in Fig. 2 is obtained at about 
%. If the voltage is set at that point, small variations of argon 
ill not make very much difference. On the other hand, at 1% 
le are almost starting on the slippery slope. 

In Fig. 3, there are no obvious jumps or steps at 1% of argon, 
ut after all, 0-5°% is nearly as good. With that content, and 
the steps are caused by the discharge jumping over the edge 
f the cylinder, then probably they could be eliminated by 
djustment. As regards the temperature coefficient shown in 
ig. 4, we agree that 1% is the best. Against that, however, 
1ere is only a change of 200 millivolts with a 50°C change 
t 0-5%. In Fig. 6, initial running-voltage drift shows little 
ifference between 0-5 and 1:0%. 


* BENSON, F. A., and CHALMERS, P, M.; Monograph No, 321 R, December, 1958 
ee 106 C, p. 82). 


When we consider the impedance of the tube in Figs. 9 
and 10, I agree that rather more argon seems to help, but against 
that you have to bear in mind that these are reference tubes 
which are not really intended as coupling elements. For that 
purpose, within the circuit, no doubt other conditions will apply. 

In Fig. 13, minimum noise occurs at about 4%. Ifa reference 
tube is to be made using the information in the paper, perhaps 
1% is a little too high. Of course, at a higher content, of the 
order of 3% or more, the tube would clearly not be satisfactory. 

Mr. J. M. Glackin: In considering the best temperature 
coefficient there are two applications to be borne in mind. The 
first is where large changes in ambient temperature may occur. 
The change in maintaining voltage may be compensated for by 
the use of temperature-dependent circuit-elements. The tem- 
perature coefficient should therefore be as linear as possible and 
reproducible from tube to tube. Secondly, where very high 
stability is required and only small variations of temperature 
may occur, there should be no sudden changes in maintaining 
voltage due to these small changes in ambient temperature, i.e. 
no ‘thermal jumps’. 

Can the authors give more details of their methods of measure- 
ment of temperature coefficient and comment on the discon- 
tinuities in these coefficients? 

Geometry has a large effect on all the tube parameters men- 
tioned, particularly on the noise content. I have found that the 
level of gas noise is a function of gas pressure, anode area 
relative to anode-cathode spacing, and current. 

There is a small but significant difference in the authors’ 
results compared to those of Penning and others. Could this 
be due to a change in gas mixture caused by a preferential 
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gettering of one of the constituent gases during the very heavy 
sputtering used in processing this type of tube? 

Mr. J. Smith: The authors say that temperature coefficients 
can be explained by maintaining-voltage/pressure curves. Such 
a curve having a negative gradient could explain a negative 
temperature coefficient because, when the envelope temperature 
is increased, the density in the cathode dark space is increased. 
The effective pressure in the cathode dark space is therefore 
increased and the maintaining-voltage decreased. 

Can positive temperature coefficients be explained in a similar 
way by maintaining-voltage/pressure curves having positive 
gradients, or must additional factors be included in the theory? 
I have found, for example, that the temperature coefficient of the 
85A2 tube at 5:SmA, which is negative at low envelope tem- 
peratures, becomes positive at an envelope temperature of 
130°C. The envelope temperature giving an effective pressure 
in the cathode dark space equal to the pressure at which the 
maintaining-voltage/pressure gradient changes from negative to 
positive is, however, much in excess of this. 

Mr. G. C. Pope: I have been interested in the application of 
neon-argon-filled cold-cathode tubes as switching elements, in 
which higher argon contents are sometimes used. These give a 
larger difference between striking and maintaining voltage and 
hence a larger voltage can be developed into the load. With 
reference to the authors’ comment that there is a general trend 
for low-argon-content tubes to have higher striking voltages, 
we have found with oxide-coated cathodes a minimum striking 
voltage in the region of 0-5-1°% argon and the striking voltage 
rises on either side of this. 

In connection with the temperature coefficient of maintaining 
voltage, Jacobs and Martin* showed that the exclusion of 
mercury vapour from argon tubes changed the temperature 
coefficient from negative to positive. Have the authors excluded 
mercury vapour from their experimental tubes, and, if not, to 
what extent does this contribute towards their results? Do the 
metastable mercury atoms add a further term to the impedance 
in the equivalent circuit? 

Noise generation increases with argon content, as does also 
the current density. To what extent does the cathode surface 
uniformity contribute to noise generation? 

Dr. R. O. Jenkins: I have been interested in the paper from 
the point of view of corona stabilizers, the geometry of which, 
quite fortuitously, happens to be very similar. One of the types 
has a 1mm diameter anode and a 7mm cathode, and the gas 
in this case is pure hydrogen at rather higher pressures than 
those used by the authors. The voltage drop is entirely con- 
fined to the neighbourhood of the anode, where the field is 
highest and the ionization is taking place. The currents are 
generally a fraction of a milliampere. 
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Such tubes always have a positive voltage/temperature coe 
cient. This ties up well with the fact that there is a posi 
variation in the voltage with pressure. On the other hand, thi 
does not seem to be a direct correlation between the slope 
the voltage/pressure curve and the temperature coefficient. | 
you regard the operating voltage as being Vo(1 + aT), the val 
of a over a fairly wide range of pressure is fairly constant 
about 0-01 % per degC, so a 400-volt tube will have a change 
absolute magnitude of voltage for a given change in temperatt 
of only about a quarter of that for a 1 600-volt tube. The sl 
of the voltage/pressure curve at the two different points are 
very different, that at 400 volts being, in fact, the greater. 

Mr. G. F. Weston: I have been measuring the dischat 
characteristics of mixtures of helium and neon, and also 
helium and argon, with particular reference to the maint 
ing potentials in geometries aimed at giving fairly unifo 
fields. ; 

I find that in both systems of mixtures the voltage/curre 
curve depends on pressure, and, as a result, the plot of mai 
taining potential at a given current against the percentage | 
neon or argon in helium is slightly different for different pressure 
This also applies to the minimum maintaining potentials. I ha 
also found that the geometry is very important, and if you use 
point anode and a plane cathode, instead of parallel planes, y« 
again get a different curve of maintaining potential against g 
composition. 

I would stress that one must specify the geometry and g 
pressure when dealing with gas mixtures, since the results mi 
only apply to the particular conditions used. If the pressu 
and/or geometry are changed, a slightly different dependence « 
gas composition may result. 

Mr. R. E. Lake: The manufacturers’ usual method 
measuring voltage jumps is by a current sweep technique, t 
jumps being displayed on an oscillograph. 

One of the first essentials of a reference tube is, of cour: 
stability during life, and I should be interested to know if L 
Benson intends to do, or has already done, some life tests « 
these valves to examine the effect of argon content on thi 
performance, or on their characteristics during life. 

Mr. J. M. Glackin: Have the authors considered the effect 
elevated temperature on tube stability during life or shelf test 

In one of the devices we manufacture we find that stabil 
on 100°C life test is slightly better than the stability at roo 
temperature. The only explanation which we can find for tl 
is a change in argon content on room temperature life test due 
clean-up. At 100°C, however, providing the tube has be 
processed to withstand high temperatures, there is a change 
rate of clean-up due to evolution of argon occluded in t 
sputtered layer. 


“| 


THE AUTHORS’ REPLY TO THE ABOVE DISCUSSION 


Dr. F. A. Benson and Mr. P. M. Chalmers (in reply): The main 
object of the present work was to find the influence of argon 
content on the characteristics of glow-discharge tubes, and high- 
stability reference tubes were used to reduce the spread in the 
results. The conclusion that 1% argon gives the most desirable 
performance was drawn after considering all the characteristics. 
We agree, however, that the noise is not then a minimum, but 
the variation of noise with argon content up to 1% is not large. 
Mr. Young points out that, at 1% argon, slight variations in 
argon content will affect the running voltage more than varia- 
tions at 0-5%, and this is, of course, quite true. The spreads 
in the results at 1% and 0-5% argon, however, are not appre- 
ciably different and cannot be attributed to variations in the 


* Journal of Applied Physics, 1950, 21, p. 681, 


argon content alone. In Fig. 3, the 1% argon curve shows 
sensibly flat running voltage over the range 1-5 mA, while curs 
for lower argon contents have no flat regions. 

In reply to Mr. Glackin, no thermal jumps were observ 
during the present investigations, but previous work usi 
coated-cerium cathodes indicated that jumps or steps in t 
voltage/temperature curves nearly always occurred in tut 
having similar steps in the running-voltage/current charact 
istics; these are thought to be due to sudden current-dens 
changes caused by non-uniform cathode surfaces. 

Running-voltage/temperature characteristics were measur 
by immersing the tube in an oil bath of fairly large thern 
capacity, surrounded by a water jacket containing a heati 
element. Running voltage was recorded by a potentiometer 


ervals of 10°C over the range 20-90°C. At each tempera- 
e, sufficient time was allowed for thermal equilibrium to be 
iched and the running voltage to become constant. This 
tied between 5 and 10 min and the temperature did not change 
nificantly during the measuring period, when the heater 
ment was switched off. We have recently investigated the 
aracteristics of tubes containing neon plus 0:3% argon, and 
d the influence of the geometry very small. We found that 
2» noise was independent of the geometry, but increased with 
5 pressure at low tube currents. It is hoped to publish the 
sults of this investigation shortly. 
We agree with Mr. Weston that the shift between the minimum 
nning-voltage curves of Fig. 2 is more likely to be due to a 
ference in geometry than to a change in gas mixture caused 
‘heavy sputtering. If the latter were the case, the difference 
juld be more pronounced at low argon contents. The largest 
screpancy occurs for argon contents exceeding 1°% and this is 
ssibly due to the start of an anode fall in these tubes. 
Mr. Pope refers to the paper by Jacobs and Martin, who 
liberately introduced a quantity of mercury into their tubes, 
that a droplet remained after preparation. There was a 
ssibility that mercury vapour was present in the tubes 
amined, but subsequent investigations with additional tubes 
‘the same type, free from contamination, produced character- 
ics similar to those reported. We therefore concluded that, if 
ercury vapour were present in the original tubes, it was 
obably cleaned up in the sputtering process, or that the 
nning effect due to the presence of mercury atoms was com- 
etely masked by the argon admixture. 
The impedance characteristics of a glow-discharge tube are 


} CHARACTERISTICS OF NEON-ARGON GLOW-DISCHARGE REFERENCE TUBES’ 135 


more easily analysed when they are plotted in the form of 
impedance loci. The equivalent circuit can be identified from 
the form of the loci and can be regarded as a series combination 
of the static resistance (slope of static characteristic) and a 
number of parallel combinations of inductance and resistance. 
Each parallel combination has a time-constant associated with a 
particular process in the discharge mechanism. If mercury 
vapour is present in a ‘pure-neon’ tube, the lifetime of the neon 
metastable atoms will be reduced and the mercury ions formed 
by the Penning effect will contribute to the discharge current. 
Hence there will be an additional parallel LR combination in the 
equivalent circuit to account for this process. Similarly, if the 
mercury metastable atoms influence the discharge, another 
element will appear in the equivalent circuit, but it is difficult 
to see what role they would play in the discharge process. 

We are interested to learn that Dr. Jenkins has observed 
positive voltage/temperature coefficients in corona stabilizers 
which could be traced back to the positive slope of the vol- 
tage/pressure curves. We are at present plotting voltage/pres- 
sure curves of glow-discharge tubes to see whether they have 
positive slopes at high argon contents. 

Life characteristics of the tubes examined are also being 
studied, and we find that during the first 1000 hours of opera- 
tion the change of running voltage is small (0-1 volt), and 
independent of the argon content above 0-01%; during 
the first 5000 hours the running voltage decreases by about 
0:4 volt. These life tests are being performed at room tem- 
perature, the tubes being operated at a mean current. We 
have not so far carried out any stability tests at elevated 
temperatures. 


DISCUSSION ON 
‘THE SQUARE-LOOP FERRITE CORE AS A CIRCUIT-ELEMENT’* 


Mr. A. Kruithof (communicated): Dr. Lindsey does not take 
e effect of the geometry of a ferrite core into account, although 
is plays a major role at the field strengths usually encountered 
, for instance, memory matrix operation. 

Goodenough attributed the ‘Gaussian’ tail of the voltage 
itput of a core to the random distribution of domains of reverse 
agnetization.t At the field strengths which are usual in ferrite 
re Operation, however, the observed voltage waveform can be 
rived from geometrical factors without necessarily assuming 
random distribution. 

Indeed, let us at first suppose that a perfectly regular distribu- 
mn of cylindrical domains of reverse magnetization nucleates, 
r simplicity’s sake, at one critical field strength H,,. This 
icleation phenomenom seems responsible for the initial 
ike’ of observed waveforms. In Permalloy cores another 
echanism is responsible. 

After nucleation the cylindrical domains have a radius rg 
ig. A) and expand at a speed which is proportional to the excess 
the existing field strength over a fictitious field strength Hg 
presenting the retarding forces. They expand at this speed 
itil ry =r,. Then the star-shaped form changes into another 
linder with radius r,, because the cylinder has a lower energy 
mtent. This cylinder shrinks and disappears. An approximate 
ve giving Ad/Ar for this process is shown in Fig. B. 

* Linpsey, C. H.: Monograph No, 327 M, February, 1959 (see 106 C, p. 117). 


f MENyYUK, N., and GooDENOUGH, J..B.: ‘Magnetic Materials for Digital Computer 
Mponents’, Journal of Applied Physics, 1955, 26, p. 8. 
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Fig. A.—Model for core reversa]. 
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Fig. B.—Approximate elementary voltage output. 
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Fig. C.—Field strength H and reversal time t as a function of the 
radius r when H = 2H at r = Ro. 
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If no regular distribution of reverse domains is assumed, | IH 
approximate curve of A¢/At of Fig. B has to be replaced by way) 
forms which could be obtained experimentally by observing th} 
output of a core which had very approximately R,/Rp ~ 1, q 
theoretically as Dr. Lindsey has done (theoretical waveform ¢ 
Fig. 7 in the monograph). By suitably adding, one again obtait} 
the output for toroids with R,/Rp~1. The result is that at ty 
field strengths usually encountered and with, for exampli) 
R,/Ro= 1:6, a random distribution or a regular distributio}, 
makes little difference, as it is the geometry that dictates the shap 
of the voltage output. of 

Dr. C. H. Lindsey (in reply): By taking a rather idealized mo 


geometry to round off the corners, Mr. Kruithof has, not sui) 
prisingly, produced a waveform with the same general appearance} 
as mine, which was based on’a similar but not so highly idealize; 
model. 

Granted that I did not take the core geometry into accoun) 
(actually, the ratio of the outside to the inside diameter of th) 
cores that I used was 1-5), this is only of importance when t 


Fig. D.—Core voltage for a core with R;/Ry = 1-6. 


Addition of approximate A@/At curves is shown for curve 1, for which H = 1-8 Ho at r = Ro. Ja 


For curve 2, H = 2Ho at r = Ro. 
For curve 3, H = 2:2H at r = Ro. 


For a toroid the total voltage output is formed by the sum 
of all such processes going on throughout its cross-section. Near 
the inner diameter the process of expansion and shrinkage is 
faster than at the outer diameter, because H — Ho decreases with 
the distance from the centre of the toroid (Fig. C). By adding 
up a number of approximate Ad/At curves, representative of the 
different parts of the toroid, we obtain the voltage output. In 
Fig. D this has been done for a ratio of outer diameter to inner 
diameter of 1°6, which is the ratio of certain well-known 
miniature cores. 


t 


actual field is not much greater than the pseudo-coercive force. 
Presumably Mr. Kruithof had this in mind when he referred to 
‘the field strengths which are usual in ferrite core operation’. 
Presumably he was referring to the field strengths used in coin- 
cident current matrix stores, but I was careful to point out, in 
my very first paragraph, that my theory did not pretend to cover 
this case. I think the evidence of Fig. 7 shows that the particular 
approximation I have made fits the facts very well over the 
relevant range, and [ doubt very much whether Mr. Kruithof’s 
model would fit the observed waveforms so well. 
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